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PREFACE 


How and what should we teach today’s undergraduates to prepare them for 
careers in mathematically oriented areas? Furthermore, how can we amelio- 
rate the quantum leap from introductory calculus and linear algebra to more 
abstract methods in both pure and applied mathematics? There is a subject 
which can take students of mathematics to the next level of development and 
this subject is, at once, intuitive, calculable, useful, interdisciplinary, and, 
most importantly, interesting. Of course, I’m talking here about Differential 
Geometry, a subject with a long, wonderful history and a subject which has 
found new relevance in areas ranging from machinery design to the classifi- 
cation of four-manifolds to the creation of theories of Nature’s fundamental 
forces to the study of DNA. 

Differential geometry provides the perfect transition course to higher math- 
ematics and its applications. It is a subject which allows students to see 
mathematics for what it is — not the compartmentalized courses of a stan- 
dard university curriculum, but a unified whole mixing together geometry, 
calculus, linear algebra, differential equations, complex variables, the calculus 
of variations, and various notions from the sciences. Moreover, differential ge- 
ometry is not just for mathematics majors, but encompasses techniques and 
ideas relevant to students in engineering and the sciences. Furthermore, the 
subject itself is not quantized. By this, I mean that there is a continuous 
spectrum of results which proceeds from those which depend on calculation 
alone to those whose proofs are quite abstract. In this way students gradually 
are transformed from calculators to thinkers. 

Into the mix of these ideas now comes the opportunity to visualize con- 
cepts and constructions through the use of computer algebra systems such as 
MAPLE and MATHEMATICA. Indeed, it is often the case that the conse- 
quent visualization goes hand-in-hand with the understanding of the mathe- 
matics behind the computer construction. For instance, in Chapter 5, I use 
MAPLE to visualize geodesics on surfaces and this requires an understand- 
ing of the idea of solving a system of differential equations numerically and 
displaying the solution. Further, in this case, visualization is not an empty ex- 
ercise in computer technology, but actually clarifies various phenomena such 
as the bound on geodesics due to the Clairaut relation. There are many other 
examples of the benefits of computer algebra systems to understanding con- 
cepts and solving problems. In particular, the procedure for plotting geodesics 
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can be modified to show equations of motion of particles constrained to sur- 
faces. This is done in Chapter 8 along with describing procedures relevant to 
the calculus of variations and optimal control. At the end of Chapters 1, 2, 
3, 5, 7, and 8 there are sections devoted to explaining how MAPLE fits into 
the framework of differential geometry. I have tried to make these sections 
a rather informal tutorial as opposed to just laying out procedures. This is 
both good and bad for the reader. The good comes from the little tips about 
pitfalls and ways to avoid them; the bad comes from my personal predelictions 
and the simple fact that I am not a MAPLE expert. What you will find in 
this text is the sort of MAPLE that anyone can do. Also, I happen to think 
that MAPLE is easier for students to learn than MATHEMATICA and so I 
use it here. If you prefer MATHEMATICA, then you can, without too much 
trouble I think, translate my procedures from MAPLE into MATHEMATICA 
or you can look at A. Gray, Differential Geometry of Curves and Surfaces, for 
a huge number of MATHEMATICA geometry procedures and examples. 

In spite of the use of computer algebra systems here, this text is traditional 
in the sense of approaching the subject from the point of view of the 1800’s. 
What is different about this book is that a conscious effort has been made 
to include material that I feel science and math majors should know. For ex- 
ample, although it is possible to find mechanistic descriptions of phenomena 
such as Clairaut’s relation or Jacobi’s theorem and geometric descriptions of 
mechanistic phenomena such as the precession of Foucault’s pendulum in ad- 
vanced texts (see V.I. Arnol’d, Mathematical Methods of Classical Mechanics 
and J. Marsden Lectures on Mechanics), I believe they appear here for the 
first time in an undergraduate text. Also, even when dealing with mathe- 
matical matters alone, I have always tried to keep some application, whether 
mathematical or not, in mind. As an example of this, J would note that the 
Weierstrass-Enneper representation is not discussed merely for its own sake, 
but for its use in solving the problem of Bjorling and because it is essential 
to understanding why (a portion of) Enneper’s surface is minimal, but not 
area minimizing. In fact, I think this last topic helps to show the boundaries 
between physics (e.g. soap films) and mathematics (e.g. minimal surfaces) as 
well as being amenable to the power of MAPLE (see Chapter 7). 

This book originally began as an attempt to fashion a one-quarter course 
in differential geometry. In fact, I have taught such a course for mathematics, 
physics, engineering, chemistry, biology and philosophy majors and I have 
used topics from Chapters 1, 2, 3, 4, 5, 6 and 8. This does not mean that I 
have covered these chapters exhaustively in one quarter, but that I have chosen 
certain parts to emphasize and allowed students to do projects, say, involving 
other parts. For example, students have done projects on involutes and gear 
teeth design, re-creation of curves from curvature and torsion, Enneper’s sur- 
face and area minimization (see above), geodesics on minimal surfaces, and 
the Euler-Lagrange equations in relativity. In many cases, students have gone 
way beyond what this book contains and I owe them my thanks for expanding 
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my knowledge. The book, as it now stands, is suitable for either a one-quarter 
or one-semester course in differential geometry as well as a full-year course. 
In the case of the latter, all chapters may be completed. In the case of the 
former, I would recommend the chapters I’ve listed above, but there is a good 
choice of alternative material as well. 

The reader should note two things about the layout of the book. First, the 
exercises are integrated into the text. While this may make them somewhat 
harder to find, it also makes them an essential part of the text. The reader 
should at least read the exercises when going through a chapter — they are 
important. Secondly, I have chosen to number theorems, lemmas, and the 
like separately from examples, definitions, and remarks. I did this to avoid 
having Lemma, 27 followed by Theorem 82, but of course this makes it harder 
to find the examples. Therefore, I have included a list of examples directly 
before the solutions to problems at the back of the book. 

There are several students who deserve special mention with regard to this 
text. Rob Clark first interested me in minimal surfaces and, together with 
Jack Chen, showed me the use of computers (e.g. Ken Brakke’s Evolver pro- 
gram) in distinguishing “minimal” from “harmonic”. Laszlo Ilyes provided 
many of the MAPLE procedures for optimal control while Carrie Kyser Heide 
took my original laughable “geodesic procedures” and transformed them won- 
derfully into one elegant procedure which does exactly what I want it to do. 
Sue Halamek did an excellent job on the first draft of the solutions to prob- 
lems and any present errors are certainly due to my final editing. Thanks to 
you all and to all the students who watched me fumble my way to a book! 

I would also like to acknowledge the contributions of my friend Allen 
Broughton. It was Allen who first taught a course from a sheaf of my hand- 
written notes and actually made sense of the notes and a success of the course. 
Allen also first explored the use of MAPLE for differential geometry and is 
responsible for producing the first procedures for calculating curvatures etc. 
Similarly, the handwritten notes referred to above would have remained just 
that without the TRXpertise of Joyce Pluth. Joyce typed the first draft of 
those notes and patiently tutored me in the intricacies of TRX until I stopped 
bothering her. Let me also thank Elaine Hoff and Dena Jones for helping 
me to photocopy, collate, cut, and paste to ready versions of this text for 
unsuspecting classes. I must also thank the members of the Cleveland Ge- 
ometry/Topology Seminar for sitting through numerous lectures on various 
parts of this text. 

Finally, the writing of this book would have been impossible without the 
help, advice, and understanding of my wife Jan and daughter Kathy. Thanks 
— the computer is now free! 


John Oprea 
oprea@math.csuohio.edu 


Chapter 1 


THE GEOMETRY 
OF CURVES 


1.1 INTRODUCTION 


A curve in 3-space R° is a continuous mapping a: I — R® where I is some 
type of interval (e.g. (0,1), (a,b), [a,b], (—oo, a], [0,1] etc.) on the real line 
R . Because the range of a is R?, a’s output has three coordinates. We then 
write, for ¢t € J, 


a(t) = (a'(t), a7(z), a(t) 


where the a’ are themselves functions a‘: J + R. A useful way to think 
about curves is to consider t to be time and a(t) to be the path of a particle 
in space. We say a is differentiable or smooth if each coordinate function a’ 
is differentiable as an ordinary real-valued function of R. The velocity vector 
of a at to is defined to be 


where da‘/dt is the ordinary derivative and |,~:, denotes evaluation of the 
derivative at t = to. We shall also write da‘/dt(to) for this evaluation when 
it is convenient. In order to interpret a’ geometrically, we introduce the 


Example 1.1: First example of a curve: A line in R°. 

We know that two points determine a line. In the plane R?, this leads to the 
usual slope-intercept equation which gives an algebraic description of the line. 
Unfortunately, in R? we don’t have a good notion of slope, so we can’t expect 
exactly the same type of algebraic description to work. For two points, p and 
q, the line | joining them may be described as follows. To attain the line, add 
the vector p. To travel along the line, use the direction vector g — p since this 
is the direction from p to g. A parameter t tells exactly how far along q — p 
to go. Putting these steps together produces a curve which is a parametrized 
line in R3, 

a(t) =p+t(q—p). 
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For instance, if p = (1, 2,3) and g = (—1,4, —7), then g — p = (—2, 2, —10), so 
the line through p and q is given by the curve 


a(t) = (1,2,3) + t(—2, 2, -10) 
= (1 — 2t,2+ 2t,3— 104). 


FIGURE 1.1. A parametrized line 


EXERCISE 1.1. What is the parametrization of the line through (—1,0,5) and 
(3, -1, —2)? 


EXERCISE 1.2. In R4, what is the parametrization of the line through (—1, 6, 5,0) 
and (0,1, —3,9)? 


So we see that, given a line a(t) = p+t(q—p), a’(t) = q—p. Since this 
direction vector q — p is then the velocity vector, we often write 


a(t) = p+ tv. 


Now that we understand lines, we can ask what the picture of the velocity 
vector is for any curve a(t)? The definitions give 


d 1 
a’ (to) = [fe 
t=to 


7 ( him 2) = e*(to) 5... OC) = o7(to) 4, 20) = o)) 


t—to t—to ” toto t — to ” toto t—to 


da 
"dt 


da? 
p24, "dt 


t=to 


aim ( (atlt):020),08(8) = (ato), 02 (to), 0°(to)) 
t — to 
SG ee ata). 
t—to t — to 
This looks much like our usual “slope” definition of the derivative. In fact, 
a(t) — a(to) is exactly the vector shown below. Note that the approximating 


1.1 Introduction 3 


FIGURE 1.2. A curve with tangent vector 


vectors inside the limit always point in the direction of increasing t. For t < to, 
this follows since t — tg is negative. 

Now, t — tg, so the vector moves closer and closer to the tangent vector to 
the curve at a(to). Hence, the velocity vector a’(to) to a at to is exactly the 
tangent vector to a at a(to). Notice that we obtain 2 vector with a precise 
length, not a line. Remember that a vector v = (v!,v?, v3) € R3, has a length 
given by the Pythagorean theorem, 


|v] = V(u")? + (v?)? + (v9)?. 


EXERCISE 1.3. Show that |v| is the distance of (v', v?,v®) from (0,0,0) by two 
applications of the Pythagorean theorem. 


In our situation, |a’(t)| = 1/ (da! /dt)” + (da?/dt)? + (da3/dt)? is simply the 
speed of a. Again, thinking of a as the path of a particle and ¢ as time, we 
see that the length of the velocity vector is precisely the speed of the particle 
at the given time. For a line, a(t) = p+tv, the speed is simply |v|, the length 
of the direction vector. 

If a(t) = (a'(t), a?(t), a(t) is a curve in R®, then its acceleration vector 


is given by : : ; 
d d d 
Le ee cs | ao 4 te es 
a(t) = Ge (t), qe (t), 7" )) 


EXERCISE 1.4. Newton asked the question, what is the curve which, when revolved 
about an axis, gives a surface offering the least resistance to motion through a “rare” 
fluid (e.g. air)? Newton’s answer is the following curve (see Chapter 8). 


GW.) = (3 Fae : in (¢) $4 ot = ZA) 


where A is a constant and x > 2X. Graph this curve and compute its velocity 
and acceleration vectors. What angle does the curve make with the z-axis at the 
intersection point (2A, 0)? 
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We now come to the first (and simplest nontrivial) instance of calculus im- 
posing a constraint on the geometry of a curve. In a real sense, this is what 
the differential geometry of curves is all about and we will see many more 
examples of it later. 


Proposition 1.1. The curve a is a straight line if and only if a” = 0. 


Proof. If a(t) = p+ tv is a line, then a’(t) = v (which is a constant vector), 
so a(t) = 0. 

If a(t) = 0 for all t, then d?a*(t)/dt? = 0, for each coordinate function 
a‘(t). But a zero 2"¢ derivative simply means that da‘(t)/dt = v' is a con- 
stant. We may integrate with respect to t to obtain a*(t) = p’ + v't where p' 
is a constant of integration. Then 


a(t) = (p' + tu’, p? + tv”, p® + tu?) =p+tv 
with p = (p', p?, p*) and v = (v!, v2, v3). Hence the curve a may be parame- 
trized as a line. O 


This easy result indicates how we will use calculus to detect geometric 
properties. Just to see how far we can get using these elementary ideas, let’s 
consider the following question. What is the shortest distance between two 
points p, g € R°? We have been taught since we were children that the answer 
is a line, but now we can see why our intuition is correct. Calculus tells us 
that distance is simply the integral of speed. Therefore, the integral 


b 
L(a) = i la! (t)| dt 


calculates the arclength of a curve a: [a,b] + R®° from a(a) to a(b). In order to 
see this, consider a small piece (“an infinitesimal” ) of arclength approximated 
as in Figure 1.3. 


FIGURE 1.3. An infinitesimal piece of curve 
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The Pythagorean theorem then says that 1 = \/dx? + dy?. Suppose both x 
and y are parametrized by t. Then a(t) = (x(t), y(t)) and dx? = (dx/dt)* dt? 
and dy? = (dy/dt)* dt?. Finally, 


dz\* (dy \? 
[= — — | dt 
(a) +z) 
expressing the relation, distance equals rate times time. Now we add up all 
the pieces of | by integration to get, 


uo [ (2) +(@ 


= fro dt. 


The three dimensional version of this intuitive description simply uses our 
earlier exercise on the Pythagorean theorem in three dimensions. Now, the 
length of the vector v, |v|, may be written ,/v-v, where - denotes the dot 
product of vectors. Recall that, in general, v- w = v!w! + vw? + v°w3, for 
v = (v', v7, v3) and w = (w!, w?, w3). We also have the following 


Proposition 1.2. The dot product is computed to be v- w = |v| |w|cos6, 
where @ is the angle between the vectors v and w. 


Proof. From Figure 1.4, we see that the usual properties of vectors give u = 
v —w. Hence 


jul? = u-u=(v—w):(v-—w) 
=v:v-2v-w+w-w 


= |v|? + wl? — 2v-w. 


FIGURE 1.4 
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But the Law of Cosines for this triangle says |u|? = |v|? +|w|? —2|v| |w| cos 9. 
Equating the two righthand sides and eliminating like terms produces 


Vv - w= |v||w|cosé. 
D 


Corollary 1.3 (Schwarz’s Inequality). The dot product obeys the in- 
equality, 
|v -w] < ]v]|w]. 


Proof. Simply note that | cos 6| < 1. 0 


We note here that, with respect to the derivative, the dot product behaves 
just as ordinary multiplication — namely, the Leibniz (or product) rule holds. 
In particular, 


Proposition 1.4. If a(t) and G(t) are two curves in R?, then 


d(a-8) _ da dB 
a de ae 


Proof. We use the ordinary product rule on the component functions, 


d(a-B) ad 11 2 92 3 93 
sp (dat a, cd! 
= (Fo +0 7) 
ie da’ ., , a2 
-¥(o+De) 
_ da dZ 


O 


When the parameter t is understood, we will write the product rule as (a-3)’ = 
a’: 8+a-('. We are now able to answer the question of which path between 
two points gives the shortest distance. 


Theorem 1.5. In R®, a line is the curve of least arclength between two 
points. 


Proof. Consider two points p, g € R°. The line between them may be param- 
etrized by I(t) = p+ t(q — p), where q — p is the vector in the direction from 
p to qg. Then l’(t) = q — p and |I’(t)| = |q — p|, a constant. Therefore, 


1 1 
1) = f Wola = lap [ #195 
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Dp 


FIGURE 1.5. Line and comparison curve 


and the length of the line segment (or direction vector) from p to q is the 
distance from p to q (as of course we expected). Now consider another curve 
a which joins p and q. 

We want to show that L(a) > L(l) and, since a is arbitrary, this will 
say that the straight line minimizes distance. Now, why should a be longer 
than /? One intuitive explanation is to say that a starts off in the wrong 
direction. That is, a’(a) is not “pointing toward” g. How can we measure 
this deviation? The angle between the unit vector in the direction of ¢ — p, 
u=q- p/ lg — p| and a’(a), the tangent vector of a at p, may be calculated 
by taking the dot product a’(a)-u. The total deviation may be added up by 
integration to give us an idea of why L(a) > L(l) should hold. Precisely, we 
compute f : a’(t) - udt in two ways to obtain the inequality. Now, we have 
(a(t) -u)’ = a’(t)- u+a(t)-u’ = a’(t)- u since u = constant vector. Also, 
the Fundamental Theorem of Calculus gives Vie df /dt dt = f(b) — f(a), so 


b b 
[@-ua= / (a(t) -u)’ dt = a(b)-u—a(a)-u 


=q:u-—p-u since a(a) =p, a(b)=q 
=(q—p)-u 
(1) _@ zs P) 
_ |a—pl' 
lq - pl 
=|q-pl 


= L(I) 
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the straightline distance from p to g. Consequently, 
b b 
/ a’(t)-udt < / |a’(t)| - |ul de by the Schwarz inequality 
a a 


b 
(2) =p |a’ (t)| dt since |u| = 1 
= L(a) 


Combining (1) and (2), we have L(l) < L(a). Note also that a‘(t)-u = 
\a’(t)| - |u| only when cos@ = 1, or 9= 0. That is, a’(t) must be parallel to 
q—p for all t. In this case a is the line from p to g. Therefore, we have the 
strict inequality L(l) < L(a) unless a = line. La] 


So, using the first notions of calculus, we have seen how to identify lines in 
R®° and to determine exactly why lines give the shortest distance between two 
points. Of course, R? is governed by Euclidean Geometry. One of the main 
goals of differential geometry is to develop analogous techniques for curved 
geometries as well. 


Other Examples of Curves 


While curves described parametrically may be defined at the whim of the 
definer, in fact they often arise from geometric or physical considerations. 
Several of the examples below illustrate this. 


Example 1.2: The Circle of radius r (centered at (0,0)). 

Of course, the question is, how can such a curve be parametrized? The defini- 
tions of sin@ and cos @ as the vertical and horizontal sides of the right triangle 
with angle @ depicted below give us a way of assigning coordinates to the 
point P. 


FIGURE 1.6. Circle of radius r 


Namely, the point P has coordinates (cos, sin@). For a circle of radius r, 
by the fundamental property of similar triangles, the vertical and horizontal 
sides of the triangle must be rsin@ and rcos@ respectively. Hence, the coor- 
dinates of the point on the circle are (rcos6, rsin@). Therefore, the circle 
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may be parametrized by the angle 0. Again thinking of the circle as the path 
of a particle, we write 

a(t) = (rcost, rsint), for 0 <t < 2n. 


There are two important observations to remember. First, a/(t) = (—rsint, 
rcost) so a(t) -a‘(t) = 0 for all t. In other words, the tangent vector is 
perpendicular to the radius. Second, a’’(t) = (—rcost, —rsint) = —a(t), so 
the acceleration vector points toward the center of the circle. Finally, we can 
compute the arclength (i.e. circumference) of the circle, 


Qn 
La) = [ |a'(t)|dt 
0 
2r 
= Vr2 sin? t + r2 cos? t dt 
0 
Qn 
= rv sin? t + cos? t dt 
0 


2n 
=| r dt 
0 


EXERCISE 1.5. Suppose a circle of radius a sits on the z-axis making contact at 
(0,0). Let the circle roll along the positive x-axis. Show that the path a followed 
by the point originally in contact with the z-axis is given by 


a(t) = (a(t —sint), a(1 —cost)) 


where t is the angle formed by the (new) point of contact with the axis, the center 
and the original point of contact. This curve is called a cycloid and we will meet it 
again in Chapter 8. Hints: Recall that s = at where s is arclength. Draw a picture 
after the circle has rolled s units. Where is the old point of contact? 


FIGURE 1.7. A cycloid 
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EXERCISE 1.6. Consider a cycloid of the form 
(x(t), y(t)) = (A+a(t—sint), B—a(1—cost)). 


Graph this cycloid to see that it is an inverted form of the one found in the previous 
exercise. Suppose a unit mass particle starts at rest at a point (Z,%) on the cycloid 
corresponding to an angle ¢ in the parametrization above. Under the influence of 
gravity (and assuming no friction), show that, no matter what initial ¢ is chosen, it 
always takes a time of 


[a ; tit 
f 3 T where g is the gravitational constant 


for the particle to slide down to the bottom of the cycloid (i.e. t = 7). This property 
was used by C. Huygens to make clocks without pendula (which he hoped would 
allow for accurate timekeeping on ships at sea and, thereby, improve navigation). 
He named the cycloid the tautochrone since the latter means “same time.” We 
will revisit the cycloid in Chapter 8 where we will see it has yet another intriguing 
property and name. Hints: potential energy is turned into kinetic energy by, vy? {2 = 
9(y¥ — y). Time equals distance divided by speed, so 


Zoot 1+ s2 
r=— | a ee eee 
V29 Ja V¥-Yy 


us — 
=/3 / 1 — cost dt 
9 Jt cost — cost 


a\" ee V2 cos § 
=,/- | —2arcsin | ———— 
9 lt V1+ cost 


using y’ = (dy/dt) /(dx/dt) = —sint/(1—cost), ¥ = B—a(1—cost), y = B—a(1— 
cost) and simplifying. Verify the final step by differentiation and compute T’. 


Example 1.3: The Astroid. a(t) = (acos*t, asin? t) for 0 < t < 2r. 
The definition of the astroid (which was discovered by people searching for 
the best form of gear teeth) is very similar to that of the cycloid. For the 
astroid, however, a circle is rolled, not on a line, but inside another circle. 
More precisely, let a circle of radius a/4 roll inside a large circle of radius a 
(centered at (0,0) say). For concreteness, suppose we start the little circle at 
(a,0) and follow the path of the point originally in contact with (a,0) as the 
circle rolls up. Roll the circle a little bit and notice that a piece of arclength 
s is used up. The key to understanding this situation is the fact that, for f, 
the angle from the center of the large circle to the new contact point and 0, 
the angle through which the small circle has rolled (measured from the point 
of contact), we have s = at and s = a@/4. Hence, 0 = 4t. 

Now we can do two things to parametrize the path of the rolling point. 
First, we can parametrize the path with respect to the center of the little 
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FIGURE 1.8. An astroid 


circle. This is easy because we are just moving an angle @ around a circle. 
There is a small problem because we measure 6 from the point of contact, 
but this is handled easily as in Figure 1.9. Note that we make real use of 
the relation 0 = 4t here. The coordinates with respect to the center then 
become x = a cos3t/4 and y = a sin3t/4. Secondly, the center of the little 
circle always remains on the circle of radius 3a/4 centered at (0,0), so that 
the center has moved to (3a cost/4, 3a sint/4). Hence, with respect to the 
origin (0,0), the rolling point has moved to 


3a a 3a, a. 
a(t) = (3 cost + i cos 3t, ze sint — Z sin3t] ; 


FIGURE 1.9 


EXERCISE 1.7. Verify the statements above and show that the formula for the 
astroid may be reduced to 


a(t) = (a cos’ t, a sin? t) 


with implicit form o2/3 4. y?! 3 = 92/3, 
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EXERCISE 1.8. From the origin draw a line through any point P of the circle of 
radius a centered at (0,a). Find the intersection Q of this line with the horizontal 
line y = 2a. Drop a vertical line from Q and intersect it with a horizontal line 
passing through P. This intersection is a point on the curve known as the witch of 
Agnesi*. Let t denote the between the vertical axis and the line through (0,0), P 
and Q. Show that the witch W(t) is given in terms of t by 


W(t) = (20 tant, 2a cos” t) : 


—4 —2 


FIGURE 1.10. The Witch of Agnesi 


Example 1.4: The Helix. a(t) = (acost, asint, bt), for 0 <t < oo. 
Notice that the the first two coordinates provide circular motion while the 
third coordinate lifts the curve out of the plane. We calculate a’(t) = (—asint, 
acost, b) and a(t) = (—acost, —asint, 0). Note that a’(t) points toward 
the z-axis. 


FIGURE 1.11. A helix 


*The “witch” was named by Maria Agnesi in the middle 1700’s — apparently as the 
result of a mistranslation! The curve had been studied previously by Grandi who had used 
the Italian word “versorio,” meaning “free to move in any direction,” in connection with it. 
Agnesi mistakenly thought Grandi had used the Italian “versiera”, which means “Devil’s 
wife” or “witch” and so Agnesi’s witch got “her” name [Ya]. 
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FIGURE 1.12. Diagram of a suspension bridge 


Example 1.5: The Suspension Bridge. 

Consider the diagram of a suspension bridge (Figure 1.12) where the cable 
supports a uniformly distributed load. This means that, over any interval 
(a, b], the weight supported is given by W = c(b— a), where c is constant. 
Newton’s Law F' = ma applies to the vertical and horizontal components of 
tension in the cable T to give force equations 


T sind =cx and T cos? = Tp 


where x and @ are as depicted and To is the horizontal force of tension pulling 
on the length of cable from 0 to z. 


EXERCISE 1.9. Using dy/dx = tan@, solve these equations to get 


Thus, the cable of a suspension bridge hangs as a parabola. 


Example 1.6: The Catenary. . 
Now suppose a cable hangs freely only supporting its own weight (Figure 
1.13). What is the curve it follows? The key difference between the preceding 
example and this is that the weight is not uniformly distributed horizontally, 
but, rather, is uniformly distributed along the length of the cable. Therefore, 
we must take arclength into account as opposed to the simpler distance along 
the axis. Let W denote the (weight) density of the cable and let s = s(z) 
denote the arclength of the cable from x = 0. 


EXERCISE 1.10. From the diagram, Newton’s Law F' = ma, dy/dz = tan@ and the 
arclength derivative ds/dz = J 1+ (y’)*, derive the following equations 


dy W 
ie To and —F=—1/1+—. 
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Ae 


FIGURE 1.13. Diagram of a hanging cable 


Now solve the second equation to get 


x 


y = C cosh (7 


)+D. 


Hints: to solve the differential equation, let z = dy/dz. The equation becomes 
separable 


oe. = La 
V1+22 To 


and the lefthand side may be integrated by substituting in z = sinh u and recalling 
cosh? u — sinh? u = 1. 


A curve of the form y = c cosh(z/c) is called a catenary from the Latin for 
“chain.” We will meet this curve again and again in the rest of the book. 


Example 1.7: The Pursuit Curve. 

Suppose an enemy plane begins at (0,0) and travels up.the y-axis at constant 
speed vp. A missile is fired at (a,0) with speed v,, and the missile has a 
heat sensor which always directs it toward the plane. Show that the pursuit 
curve which the missile follows is given implicitly by the differential-integral 


equation 
y=ary +2 [vitae 
Vm 


Differentiate this expression to get a separable differential equation. Integrate 
to get the closed form expression for the pursuit curve 


Pp. — Ze: 
av Up V, vp a Ym Vp V Vp 
= poe Mae gi Um — Pg) tm — git Ym + Pg it Um 
to 2(1+ 22) 
( ~ rane Um ( Um Vm 
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Example 1.8: The Mystery Curve. a(t) = (4; cost, sint, a COS t) for 
O<t< 2dr. 


EXERCISE 1.11. Identify this curve. Find a’(t), a(t), and L(a). Hints: what kind 
of curve is this? Where does it lie in R?? 


EXERCISE 1.12. Parametrize a circle which is centered at the point (a, b). 


EXERCISE 1.13. Parametrize the ellipse 7 + ue a Bs 


1.2 ARCLENGTH PARAMETRIZATION 


So far, we have thought of the parameter ¢ as being time. More often it 
is convenient to let the parameter express how far along the curve we are. 
That is, when we now write a(s), the parameter s is exactly the distance we 
have travelled along the curve. This is called parametrization by arclength. 
As we will see, arclength parametrization simplifies the differential geometry 
of curves considerably. Therefore, it would be nice to know that every curve 
may be parametrized in this way. This is, in fact, the case, although it is 
only “in principle” that this process works in general. By this we mean to 
say that, while any curve may be parametrized by arclength, it is quite rare 
to obtain an explicit formula for the arclength parametrization. With this in 
mind, we will now first consider the notion of parametrization a little more 
carefully, then prove that every curve has an arclength parametrization. In 
the next section we will use the arclength parametrization to derive the Frenet 
Formulas, the formulas which describe the geometry of curves. 

Given a curve a: J — R?® with parameter t, we may reparametrize the 
curve by mapping another interval onto J and using the composition as a 
“new” curve (which, of course, has the same point-set image in R*). Precisely, 
let h: J ~ I be a map of the interval J onto J. Then a reparametrization of 
@ is given by 

B=aoch: JR’, B(s) = a(h(s)). 


The curves 3 and a pass through the same points in R®, but they reach any 
one of these points in different “times” (s and t). The velocity vectors of a 
and @ are altered by h. 


Lemma 2.1. (’(s) = a’(h(s)) - 44(s). 


Proof. By definition, 6(s) = (a*(h(s)), a?(h(s)), o°(h(s)). The derivative of 


the coordinate function {3’ is, 


a ye da'(h(s)) _ da’ dh 
ds dh ds 
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by the chain rule. But da*/dh is exactly da‘/dt where h(s) = t € I, so we 
obtain for t = h(s), 


dot dh da? dh da? dh dh 
Ug yes | a a. 
Bs) (F ds’ dt ds’ dt a) aa 


O 


EXERCISE 2.1. Recall that the arclength of a curve a: [a,b] > R® is given by 
L(a) = f |o’(t)|dt. Let G(s): [c,d] > R®° be a reparametrization of a defined by 
taking a map h: [c,d] — [a,b]. Show that the arclength does not change under 
reparametrization. 


A curve a(t) is regular if a/(t) # 0 for all t € J. In general, if a curve is 
not regular, then we split it into regular pieces and consider each of these 
separately. 


Example 2.1: The Cusp a(t) = (¢?, t°). 

Note that « = t?, y = t?, so y = +2°/? with a/(t) = (2t,3t?) and a’(0) = 
(0,0). Thus a@ is not regular, but, the pieces above and below the x-axis are 
regular. 


Theorem 2.2. Ifa is a regular curve, then a may be reparametrized to have 
unit speed. 


Proof. Define the “arclength function” to be 


s(t) = f lalla 


(where u is just a “dummy” variable). Since a is regular, the Fundamental 
Theorem of Calculus implies 


By The Mean Value Theorem, s is strictly increasing on I and so is one-to- 
one. Therefore, s has an inverse function which we denote by t(s) and the 
respective derivatives are inversely related, 


dt 1 
Fra = Bi(s)) > 0. 
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Let G(s) = a(t(s)). Then, using the Lemma, (’(s) = a’(t(s))#(s). But now 
we have, 


Oa COMPO) 


ds dt 
a 5 tls) aa) 


ds 1 
= q tls) Bes) 


=], 
E 


Without loss of generality, suppose @ is defined on the interval [0,1]. Consider 
the arclength of the reparametrization 3 out to a certain parameter value so, 


L(s0) =f [e'(a)ias= f° 14s = oo 


This is, of course, exactly what we mean by “arclength parametrization.” 
Hence, a curve is parametrized by arclength exactly when it has unit speed. 


Example 2.2: The Helix. 

Consider the helix a(t) = (acost,asint, bt) with a/(t) = (—asint,acost, b). 
We have |a’(t)| = Va?+6? =c. Then s(t) = fo cat = ct and the inverse 
function is t(s) = 15, So an arclength reparametrization is given by 


G(s) = a(s/c) = (acos =,asin : *), 


Note that |3’(s)| = 1. This example generalizes to the case of any a with 
constant speed. Such a curve may be parametrized by arclength explicitly. 
For general curves, however, the integral defining s may be impossible to 
compute in closed form. 


EXERCISE 2.2. Reparametrize the circle of radius r to have unit speed. 
EXERCISE 2.3. Can you explicitly reparametrize the ellipse to have unit speed? 


EXERCISE 2.4. Let a(t) be a curve not necessarily parametrized by arclength. 
Suppose a string of length C is wound around the curve a and is attached to the 
curve at s(tc) = C, where s is the arclength function along a. If the string is kept 
taut and unwound from the curve, the free end always lies a distance s(t) away from 
the point a(t) in the direction —a’(t). Thus, show that the curve swept out by the 
free end is given by 


a(t) 


T(t) = alt) — s(t) rey 
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This curve is an involute of a. Calculate and graph the involute of the unit circle 
a(t) = (cost, sint, 0). Note that the involute of a unit speed curve a(s) is given by 
T(s) = a(s) — sa‘(s). For an application of circle involutes to gear teeth design see 


[SU]. 


EXERCISE 2.5. Show that the involute of a helix is a plane curve. 


1.3 FRENET FORMULAS 


In this section we shall assume our curves are parametrized by arclength. 
Consider this problem. Suppose a robot arm with three fingers at its end is to 
move and grasp an object. How should a path be programmed for the “hand” 
so that the fingers end up in the proper “grasping” position? Somehow, a 
curve (which may not be a line, of course, since various obstructions have to 
be avoided) must be parametrized to align the fingers correctly. We will see 
that the geometry of a curve (i.e. its twisting and turning) may be completely 
described by understanding how three “fingers” attached to the curve vary as 
we move along the curve. The variation is described by what are now called 
the Frenet Formulas. What are these “fingers” which are attached to the 
curve? 

The first finger is the tangent vector to the curve. Let @: I + R® be a unit 
speed curve (i.e. |G’| = 1). Denote the unit tangent vector along 6 by T = £’. 
Because |T'| = 1 everywhere along 3, the derivative T’ measures only the rate 
of change of T’s direction. Hence, T’ is a good choice to detect some of (’s 
geometry. 


Trick to Remember 3.1. |T| = VT-T = 1, soT-T = 1 as well. The 
product rule then gives 


0S (7 TY aT -T4T+T =2T -T". 
Hence T - T’ = 0 and, therefore, T’ is perpendicular to T. 


105005 145 95 4 


FIGURE 1.14. Curve with T and T’ 
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We say that T’ is normal to @. Because there is no reason why the length of 
T’ should be 1, however, we define the curvature function of 2 to be 


K(s) = |T"(s)]. 


Of course & > 0 and « increases as @ turns more sharply. If « = 0, then, 
as we will see in Theorem 3.2 below, we know everything about the curve @ 
already. Therefore, for the discussion below, assume k > 0. 


EXERCISE 3.1. Suppose @ is a curve in the ry-plane. Show that, up to sign, the 
curvature of @ is given by q8 where @ is the angle between T and e; = (1,0). 


Since we would like our fundamental quantities to be unit vectors, we define 
the principal normal vector along G to be N = T’ . Note that for this to 
make sense we require k > 0. Then, by the definition of curvature, |N| = 1. 
We need to have a third vector along @ as part of our 3-fingered orientation in 
3-dimensional space and this third vector should be perpendicular to both T 
and N (just as T and N are to each other). Before we make the definition, we 
recall some facts about another vector operation, the cross product. Unlike 
the dot product of two vectors (which produces a real number), the cross 
product of two vectors produces another vector. Let v = (v',v?,v?) and 
w = (w!,w?,w?). Then we define the cross product of v and w to be (in 


coordinates), 


v x w= (vw? — v3 w?, vw! — v3, vw? — vw"). 


An easy way to remember this formula is to compute the “determinant” 


where e€;, €2, e€3 denote the unit vectors (1,0,0), (0,1,0) and (0,0,1) re- 
spectively. Note that the traditional notation of vector calculus denotes these 
vectors by 7, 7 and k. 


4 


EXERCISE 3.2. Compute the determinant and verify that it gives the formula for 


7 


VX w. 


EXERCISE 3.3. What happens if the 2"¢ and 3° rows of the determinant are 
interchanged? 


EXERCISE 3.4. Verify (by tedious computation) the following formula known as 
Lagrange’s Identity. 
Iv x wl? = |v/?lwl? — (v-w)?. 
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EXERCISE 3.5. Use the previous exercise to show that 
lv x w| = |v| [wi sin8 


where @ is the angle between v and w. Note then that v x w = 0 if and only if v 
and w are parallel (i.c. w = av fora €é R. 


EXERCISE 3.6. Show that v-(v x w) = 0 = w:(v x w). Therefore, the cross 
product of two vectors is a vector perpendicular to both of them. 


EXERCISE 3.7. Show that |v x w| is the area of the parallelogram spanned by v 
and w (as shown in Figure 1.15). 


FIGURE 1.15 


Now, knowing what we do about the cross product (and always assuming 
« > 0), we may define the binormal along G to be B= T x N. Note that 
{B| = |T| |N|sin90° = 1-1-1 =1, so B is already a unit vector. Also, as we 
have seen above, B-T =0= B-N. The set {T, N, B} is called the Frenet 
Frame along 3. The measurement of how T, N, and B vary as we move along 
G will tell us how @ itself twists and turns through space. 


1 B 


mr 305 aoc arg og. <A 


FIGURE 1.16. Frenet frame along a curve 
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The variation of T, N, and B will be determined by calculating the derivatives 
T’, N’, B’. We already know T’ = kN by definition of N, so the curvature 
« describes T’s variation in direction. We must still find B’ and N’. Now, T, 
N and B are three mutually perpendicular unit (i.e. orthonormal) vectors in 
the 3-dimensional space R, so any vector in R? is some linear combination of 
them. In particular, B’ = aT +bN-+cB. If we can identify a, b and c, then 
we will know B’. To do this we use what we know about T, N and B. First, 
T-B’=aT:T+bT-N+cT-B 
=a-1+6-0+c-0 


4: 
Similarly, N - B‘=b and B- B’ =c. Therefore, 
B! =(T- B')T+(N-B')N+(B-B’)B. 


Now let’s identify T’-B’. We know T-B = 0, so that 0 = (T-B)' = T’-B+T-B’ 
by the product rule. Then, using N - B = 0, we obtain 


We can also identify B-B’. We know B-B = 1, s00 = (B-B)' = B’-B+B-B' = 
2B: 3B’. Thus, B- B’ = 0. Now we are left with a single possibly nonzero 
coefficient in the expression for B’. Since we can’t immediately identify it in 
terms of known quantities, we give it a name. Define 7 = —N- B’ to be the 
torsion of the curve G. By what we have said above, 


Bi =—-TN. 
Now we find N’. Just as for B’, we have 
N'=(T-N')T+(N-N')N+(B-N’)B. 


The same types of calculations give T: N = 0,so0 =7T’-N+T-WN’ and 
T’ =«N soT:-N' =-KkN-.N=-k. Also, N-N =1,s0 N-N’ =0 and 
B:N=0, so B’-N+B-N'=0. Hence, B-N’ = —-B'.N=-N-B' =r 
by|definition. Thus, 

N=-«T+TB. 


Theorem 3.1 (The Frenet Formulas). For a unit speed curve with « > 0, 
the derivatives of the Frenet frame are given by 


T’= KN 
N'=-«T+7TB 
B'=-rN. 
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Example 3.2: The Circle. 

Consider a circle a(t) = (rcos£,rsin£,0). The derivative of a is a’(t) = 
1/r(—rsint/r,rcost/r,0) = (—sint/r,cost/r,0) with |a’(t)| = 1, so a has 
unit speed and T = a’. Now T’ = a” = (-1/rcost/r,—1/rsint/r,0) = 
—1/r(cost/r,sint/r,0) where (— cost/r, —sint/r,0) is a unit vector and thus 
equal to N. Now, T’ = +1/rN, so « = 1/r. That is, for a circle of radius 
r, the curvature is constant and is equal to 1/r. This makes sense intuitively 
since, as r increases, the circle becomes less curved. The limit « = 1/r — 0 
reflects this. Now, we wish to understand the torsion of a circle. 


t 
N= Cae. ee 0) 
re le r 
1 
=--T 
m 
= —KT. 
But the Frenet Formulas say N’ = —«KT +78, so we must have 7 = 0. 


Therefore, the circle has zero torsion. We shall see a general reason for this 
fact shortly. 


EXERCISE 3.8. The Frenet Formulas help us to identify curves even when they are 
given in rather complicated parametrized forms. The curve below will also appear 


in Exercise 5.3. Let G(s) = (a LD sg +s) for -l<s<1l. 
(1) Show that @ has unit speed. 
(2) Show that « = 1/,/8(1 — s?). 
(3) Show that N = CLD y EICEDY 0) and 
B=TxN = (=¥it8, Wize v2), 


(4) Show that 7 = x. 


EXERCISE 3.9. If a rigid body moves along a curve a(s) (which we suppose is unit 
speed), then the motion of the body consists of translation along a and rotation 
about a. The rotation is determined by an angular velocity vector w which satisfies 
T’ =wxT, N' =wxWN and B! = wx B. The vector w is called the Darbouz 
vector. Show that w, in terms of T, N and B, is given by w = 7TT+4B. Hint: 
write w = aT +b6N+cB and take cross products with T, N and B to determine 
a, b and c. 


EXERCISE 3.10. Show that T’ x T” = x2w where w is the Darboux vector. 


EXERCISE 3.11. Instead of taking the Frenet frame along a (unit speed) curve 
a: [a,b] + IR?, we can define a frame {T,U, V} by taking T to be the tangent 
vector of a as usual and letting U be any unit vector field along a with T-U = 0. 
That is, U: [a,b] + R® associates a unit vector U(t) to each t € [a,b] which is 
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perpendicular T(t). Define V = T x U. Show that the natural equations (i.e. 
“Frenet Formulas”) for this frame are 


T’ =w3U —weV 
U! = -w3T +01V 
V! =woT —w,U 


where w 1, w2 and w3 are coefficient functions. Furthermore, show that the Darboux 
vector w satisfying T’ = w x T, U'’ = wx U and V’ = w x V is given by w = 
wy T +02U +03V. 


EXERCISE 3.12 [Sco]. A unit speed curve of constant precession is defined by the 
property that its (Frenet) Darboux vector w = rT. +B revolves about a fixed 
line in space with constant angle and constant speed. Show, by following the steps 
below, that a curve of constant precession is characterized by having 


k(s) = asin(bs) and T(s) = acos(bs) 


where a > 0 and 6 are constants and c = Va? + b?. First, show that the following 
five properties are equivalent for a vector A = w+0WN and a line @ parallel to A(0): 
(i) |w| = a (ii) cos@ = a/c, where @ is the angle between w and A (iii) |N’| = a 
(iv) cos(7/2 — 6) = b/c, where 1/2 — @ is the angle between A and N (v) [Al =c. 
Second, given any of the properties just listed, show that A is always parallel to @ 
(ie. A(0)) if and only if w’ = —b N’. Finally, show that A’ = 0 if and only if 


rv =bk and ki = —br. 


Solve this system of differential equations to get the result. MAPLE’s “dsolve” com- 
mand may be useful. An explicit parametrization for curves of constant precession 
is given in the MAPLE section at the end of this chapter. 


EXERCISE 3.13. Let a and ( be two closed curves (i.e. a(a) = a(b) and similarly 
for @) which do not intersect. The linking number of a and G, Lk(a, 3), is defined 
by projecting R? onto a plane so that at most two points of a and £ are mapped 
to a single image. Then, assigning +1 to every a undercrossing according to the 
orientations shown below and summing the +1’s gives Lk(a, (3). 


+1 = 


FIGURE 1.17. Linking number orientations 
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Let {T,U,V} be a frame as in Exercise 3.11. The twist of U about a is defined to 


be 
b 


Tw(a,U) = =f wy at. 


a 


Let @ = a+e€U, where € is small enough so that a and G do not intersect. Show 
that, if a is a plane curve, then Lk(a,3) = Tw(a,U). This is a special case of 
White’s formula which has proved important to modern DNA research (see [Poh], 
[PoR] for instance). 


EXERCISE 3.14. For the Frenet frame {T, N, B}, show that the twist of the principal 
normal N about a is (up to a multiple of 1/27) simply the total torsion of a. That 


is, show 
b 


Tw(a,N)= ral dt. 


a 


Example 3.3: The Helix. 

Consider the helix G(s) = (acos#, asin£, %) with c = (a? + b*)!/?. Now 
B'(s) = (-a/csins/c, a/ccoss/c, b/c) with |@’(s)| = a*/c? +b? /c? = c?/c? = 
1, so @ has unit speed. Thus T = 6’(s) and T’ = B”’(s) = (—a/c* coss/c, 
—a/c* sins/c, 0) = a/c?(—coss/c, —sins/c, 0) =a/c?N. Thus 


a a 


Grae): 


a constant. 
EXERCISE 3.15. Show that 7 = b/c? = b/(a” + b”), a constant. 


Constraints on curvature and torsion produce concomitant constraints on the 
geometry of a curve. The simplest constraints are contained in the following 


Theorem 3.2. Let G(s) be a unit speed curve. Then, 


(1) « = 0 if and only if @ is a line. 
(2) fork > 0, tT = 0 if and only if G is a plane curve. 


Proof. 

(1) Suppose @ is a line. Then @ may be parametrized by G(s) = p+ sv with 
|v| = 1 (so @ has unit speed). Then T = 6’(s) = v constant, so T’ =0=KN 
and, consequently, « = 0. 

Suppose k = 0. Then T’ = 0 by the Frenet Formulas, so T = v is a 
constant (with |v| = 1 since @ has unit speed). But 6’(s) = T = v implies 
G(s) =p+t sv with p a constant of integration. Hence, @ is a line. 

(2) Suppose r = 0. Then, by the Frenet Formulas, B’ = 0, so B is a constant. 
But this means ((s) should always lie in the plane perpendicular to B. We 
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(0) 


FIGURE 1.18. Plane through 6(0) normal to B 


show this. Take the plane determined by the point G(0) and the normal vector 
B. Recall that a point P is in this plane if (P — G(0))-B=0. 

We will show that, for all s, (G(s) — 8(0))-B = 0. Consider, ((G(s) — (0))- 
B)’ = (6(s) — B(0))'- B + (B(s) — B(0))- BY = B'(s)- B=T- B=0. Hence, 
(G(s) — G(0))- B = constant. To identify this constant, evaluate the expression 
at s = 0. We get (3(0) —G(0))-B = 0. Then for all s, (G(s) —@(0))-B = 0 and, 
hence, 3(s) is in the plane determined by ((0) and the constant vector B. 

Suppose @ lies in a plane. Then (as we noted above) the plane is determined 
by a point P and a normal vector n # 0. Since # lies in the plane, 


(G(s) -—P):-n=0 forall s. 


By differentiating, we obtain two equations: ’(s)-n = 0 and B"(s)-n = 0. 
That is, T-n =0 and « N-n = 0. These equations say that n is perpendicular 
to both T and N. Thus n is a multiple of B and 


+n/|n| = B. 
Hence B is constant and B’ = 0. The Frenet Formulas then giver =0. U 


So now we see that curvature measures the deviation of a curve from being 
a line and torsion the deviation of a curve from being contained in a plane. 
We know that the standard circle of radius r in the zy plane has 7 = 0 and 
kK =1/r. To see that a circle located anywhere in IR? has these properties we 
have two choices. We could give a parametrization for an arbitrary circle in 
R®? or we could use the direct definition of a circle as the collection of points 
in a plane which are a fixed distance from a given one. In order to emphasize 
geometry over analysis (for once), we take the latter approach. 


Theorem 3.3. A curve ((s) is part of a circle if and only if k > 0 is constant 
and T = 0. 


Proof. Suppose 2 is part of a circle. By definition, G is a plane curve, so 
tT = 0. Also by definition, for all s, |G(s) — p| = r. Squaring both sides gives 
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(G(s) — p) - (G(s) — p) = r?. If we differentiate this expression, we get (for 
T =f’) 
2T -(G(s)-p)=0 or T- (G(s) —p) =0. 
If we differentiate again, then we obtain 
T’ -(G(s)-—p)+T-T=0 
(*) KN -(G(s)-p)+1=0 
KN - (G(s) —p) =—1. 

This means, in particular, that « > 0 and N - (G(s) — p) #0. Now differenti- 
ating (*) produces 

dk 


aN: (G(s) — p) +N’ - (G(s) -p) +nN-T=0 


& N- (G(s) ~p) +5(-KT + 7B) -(B(s) ~p) +0=0. 


Since T = 0 and T - (G(s) — p) = 0 by above, we have 
dk, 
= N - (G(s) ~ p) =0. 


Also, N - (G(s) — p) # 0 by above, so ds = 0. This means, of course, that 
«& > 0 is constant. 

Suppose now that 7 = 0 and & > 0 is constant. To show ((s) is part of a 
circle we must show that each ((s) is a fixed distance from a fixed point. For 
the standard circle, from any point on the circle to the center we proceed in the 
normal direction a distance equal to the radius. That is, we go rN =1/kK N. 
We do the same here. Let 7 denote the curve 


1 
1(s) = B(s) +N. 


Since we want 7 to be a single point, the center of the desired circle, we must 
have 7'(s) = 0. Computing, we obtain 


1 
7'(s) = B'(s) + . N' since « is a constant 


1 
=T+ (oaE + 7B) 
il Rn & since T = 0 
=; 


Hence 7+(s) is a constant p. Then we have 
1 1 
\8(s) - pl =|- N= =, 


so p is the center of a circle G(s) of radius 1/k. i) 
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EXERCISE 3.16. Show that, if every tangent line of a curve @ passes through a 
fixed point p, then @ is a line. Hint: p = @(s) + r(s)@’(s) for some function r(s). 


EXERCISE 3.17. Let a be a unit-speed curve which lies on a sphere of center p 
and radius R. Show that, if 7 # 0, then a(s) — p = -iNn - (4)'2B and R? = 


ye (Hay, 


EXERCISE 3.18. Show that, if (4)/ # 0 and (4)? + ((4)'2)? is a constant, then a 
(unit speed) curve a@ lies on a sphere. 


EXERCISE 3.19. Compute the curvature « and torsion 7 of the curve 


a(t) = (a cos*(t), a sin(t) cos(t), a sin(t)). 
}> 
Then, for a = 1, graph a(t) and the sphere of radius one centered at the origin 
on the same set of axes. Also, show that the curve a(t) is a spherical curve by 


computing 5 ; 
(5) +(2(6) wen) 


to be a constant (which must be the squared radius of the sphere on which a lies). 
Note that the extra |a’(t)| = ds/dt is necessary since a is not unit speed and the 
derivative of 1/« here is with respect to t whereas the derivative of 1/x above was 
with respect to arclength s. Hint: use MAPLE. 


EXERCISE 3.20. Find a parametrization for the curve obtained by intersecting the 
following sphere and cylinder: 


f 
a? + y? +2? = 4a? (x — a)? +y* =a’. 


Hint: xz = acost + a, y = asint. work in the cylinder. Plug these z and y into the 
sphere’s defining equation and solve for z. This curve is known as Viviani’s curve. 
Compute the curvature and torsion of Viviani’s curve and (since we know the curve 
is spherical!) verify that the formula 


a” 4 
are es a ee eee 
K T\K/ |o(t)| J 
holds. Finally, graph Viviani’s curve on the sphere. Compare Viviani’s curve with 
the curve of Exercise 3.19 and make a conjecture. Can you prove your conjecture? 


EXERCISE 3.21. Find « and 7 for the curve G(s) = (4 cos(s), sin(s), 
A cos(s)). Identify the curve. 
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EXERCISE 8.22. Let a be a unit-speed curve. Show that there is a unique circle @ 
with 
B0)= (0)  B(0) =a"(0)_—B"(0) = a"(0). 


Hint. Start with G(s) = p+ Rcos(#)vi + Rsin()v2 where v; and v2 are or- 
thonormal vectors. Your proof will show that @ lies in the plane spanned by T and 
N. This is the osculating plane of a and £3 is called the osculating circle of a. Your 
proof will also show that the radius of @ is exactly 1/Kq(0). In this sense ( is the 
circle of closest fit to a at a(0). Here are two things to think about. Does the 
osculating circle always lie on one side of the curve? How could you use the idea of 
the osculating circle and a computer to calculate curvature at points of a curve? 


1.4 NONUNIT SPEED CURVES 


Although all regular curves have theoretical unit speed parametrizations, 
some (such as the ellipse) do not admit explicit ones. Therefore, in order to 
understand the geometry of these curves we must obtain modifications of the 
Frenet Formulas which account for nonconstant speed. 

Let a(t) be an arbitrary curve with speed v = |a’(t)| = ds/dt. Theoretically 
we may reparametrize a to get a unit speed curve @(s(t)), so, we define 
curvature and torsion of a in terms of its arclength reparametrization a(s(t)). 
Moreover, the tangent vector a’(t) is in the direction of the unit tangent 
T(s) of the reparametrization, so T(s(t)) = a’(t)/|a’(t)|. This says that we 
should define a nonunit speed curve’s invariants in terms of its unit speed 
reparametrization’s invariants. 


Definition 4.1. 
(1) The unit tangent of a(t) is defined to be T(t) wT T(s(t)). 
(2) The curvature of a(t) is defined to be «(t) os R(s(t)). 
(3) If « > 0, then the principal normal of a(t) is defined to be N(t) 
N(s(t)). 
(4) If « > 0, then the binormal of a(t) is defined to be pe ) el B(s(t)). 
(5) If « > 0, then the torsion of a(t) is defined to r(t) = = ' +(s(t)). 


Theorem 4.1 (The Frenet Formulas for Nonunit Speed Curves). 
For a regular curve a with speed v = ds/dt and curvature k > 0, 


T’ = «vN 
N'=-«KvT+7vB 
B'=-ryN 


Proof. The unit tangent T(t) is T(s) by definition. Now T’(t) denotes differ- 
entiation with respect to t, so we must use the chain rule on the righthand 
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side to determine « and rT. 


= K(s) N(s)v 
= K(t) N(t)v(t) by definition 
T’ = Kv N, 


so the first formula is proved. For the second and third, 


dN(s) ds 
' = eae 
NS ds_ dt 


= (-KT +7 B)p 


by the unit speed Frenet formulas, 
=—-KVT+7v B. 


and 
dB(s) ds 
! = <n 
BQ) ds_ at 
= —F Nv 


= —TV N. 


Lemma 4.2. a’ = vT anda” = ae T+Kv7QN. 


Proof. Since a(t) = G@(s(t)), the first calculation is, 


a’ (t) = a (s)— 
= vT(s) 
= v(t) T(t) 


while the second is 


a(t) = a T(t) + v(t) T’(t) 


dv 
Tae T(t) + v(t)K(t)v(t) N(t) 


dv 2 
errs T(t) + K(t)v*(t) N(t). 
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EXERCISE 4.1. Work out the following precisely. The formula for a” allows an 
analysis of the following physical situation. Suppose a car is going around a level 
highway curve at constant speed (only for convenience). Then the centripetal accel- 
eration of the car is a’ = xv” N. The only force acting on the car is the frictional 
force required to keep the car on the road. As long as the tires roll and do not skid, 
this force is given by a coefficient of friction p (typically ~ .60) multiplied by the 
weight (mass time gravitational constant) mg of the car. By Newton’s Law, to keep 
the car on the road we require 


pmg = m«v* or Lug > Kv”. 
For a road of given curvature k, to stay on the road, speed must satisfy 


v< Po 
K 


What changes if the road is banked? 
From the definitions above and what we know about unit speed curves, for 


a nonunit speed curve a, we have T = a’/|a’|,B=TxNandN=BxT. 
We also have the 


Theorem 4.3. For any regular curve a, the following formulas hold. 


a’ xq" 
1) B= —_——_ 
( ) la’ x al|’ 
la’ x a" | 
(2) K= Jae” 
(3) a (a! x a’) Ql! 


la’ x a" |2 


Proof. For (1), we use the formulas of the Lemma to get 
a’ x al’ = (vT) x (ST + nv? N) 


=vZTxT +P x N 


=0+x«y°B 


/ x i 
Hence, |a’ x a”| = kv3, so B = st 
la’ x a” 


For (2), we use the expression for a” in the Lemma, take cross product 
with T = a’/|a’| and note that T x T = 0 to isolate the curvature, 


Txoa’!=0+Kv7TxN 


a’ x al” 
la’| 


=Kv*T x N, 
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so taking lengths and using |B| = 1 produces 


/ x Ld 
Ieee) 2B 
la’ x a” | 
cam aes = RK. 


For (3), we take the third derivative 


mi dv 7 2 ; 
a = a +Ky*N 


dy dv_, dk 
= aa rt He aes 


Therefore, since T’ = « N and B is perpendicular to T and N, we get 


B.a"” =Kv*B-N' 


= kv? B- (—Kv T + rv B) 


= KTV? 


since B-T =0 and B-B=1. Nowa’ x a” = Kv B, so 


(a’ x a’) F al” _ kv? B . al” 


= Kv? (KTv*) 


>= K7V8 7, 


Of course, |a’ x a’’|? = «2v*, so we have 


= (a! x a’) Ql! _ (a! x a”) al! 
K2p6 = la’ x a" |2 . 


EXERCISE 4.2. For the ellipse a(t) = (acost, bsint), show that x 


v?>=N + nave N+x«v? N’. 
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0 


ab/(a* sin? t + b? cos? t)3/ 2 Note the case of the circle a = 6. Show that a for- 


mula for the curvature of any plane curve a(t) = (x(t), y(t)) is given by 


foes |ar’ 9” ~ xy’ | 
sd (x!? + y!?)3/2" 


EXERCISE 4.3. Compute T, N, B and the curvature and torsion for the curve 


A(t) = (e' cost, e* sint, e*). 
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The curve a(t) = (cosht, sinht, t) is known as a hyperbolic heliz. Recall that 
the hyperbolic trigonometric functions are defined by the formulas, 
eat t_p-t sitet 


e e Se 
ht = ————,__ sinht = ———_,_ tanht = ———... 
cos 5 in 5 n nee 


We have the fundamental identity cosh? t—sinh? t = 1 as well as the derivative 
formulas 


© (cosh t) = sinht, © (sinh t) = cosht. 


EXERCISE 4.4. For the hyperbolic helix, show that 
eo ae oq (tanh t, 1, secht) where secht = ee 
(2). Bi wa(- tanht, 1, —secht), 
(3) N = (secht, 0, —tanht), 
(4) & =1/2cosh’ t, 
(5) r =1/2cosh? t. 


EXERCISE 4.5. Compute s(t) = i \a’|dt for the hyperbolic helix a. 


EXERCISE 4.6. Let a(t) be a not necessarily unit speed curve with involute Z(t). 
Show that the curvature of the involute Z(t) is given by 


V Ka(t)? t)? 
s(t) Ka(t) 
where s(t) is a’s arclength function. What does the formula become if a is a plane 
curve? Compute the curvature of the involute of the unit circle directly and from 
the formula above. 


Example 4.2: Plane Evolutes. 
The plane evolute of a plane curve a(t) is the plane curve whose involute is 
a. A formula for the evolute is given by 


1 
E(t) = a(t) + AO N(t) 


where N is the normal of a. Note that the evolute is the curve consisting of 
the centers of the osculating circles associated to a. To see that the involute 
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of € is a, first compute 
she 1 
E’=a'+ (=) N+-—N' 
kK K 


iy 1 
=] N+ (au T + Ty B) 


K 


II 
Q. 
a 
~~ 


1 / 
=] N-vT since 7 = 0 
K 


II 
Q. 
at 
~ > ee 


= (<) N since a’ =v T. 


The length of €’ is then |€’| = | (4)’ |. Note then that the unit tangent vector 
Ts of the evolute is the normal N of a. The involute of the evolute is then 
given by 


Here we have used the fact that the arclength function for the evolute is 
s(t) = "GE This follows since s(t) = ie \E'|du = i | (4)’ | du = +. We note 
here that evolutes may be defined for 3-space curves also. The formula for a 
space evolute is € = a+ + N + <B where c is a constant. 


Example 4.3: The Plane Evolute of a Parabola. 
Suppose a parabola y? = 2az is given with parametrization a(t) = (¢?/2a, t). 
Then a’(t) = (t/a, 1), v = |a’| = Va? + t?/a and, therefore, T = (t/Va? + #?, 


a/Va* +t). From this we may calculate curvature, 


ge: a? —at 
~ (a2 + ¢2)3/2’ (a2 4 t?)3/2 


=kKUN 
oe Ver N 
7 a 
and taking lengths gives 
a 
T'| ==> 
) a? + ¢? 


Va? +t? 


a 
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Hence, « = a”/(a? + t?)3/2 and, putting this in the expression for T’, N = 
(a/Va? + t?, —t/Va? + t?). Finally, the evolute is then given by 


1 
E(t) = a(t) + aD N(t) 


-(¢ ON (as a 
Ng a2 Var +t?’ Ja?+#? 
_ (207 +30? #3 

=( 2a 3): 


EXERCISE 4.7. Show that the x and y coordinates of €(t) satisfy the relation 
27ay* = 8(x —a)?, 


This curve is known as Neil’s parabola and we will meet it again when we discuss 
minimal surfaces. 


0.4 


0.2 


FIGURE 1.19. Neil’s parabola 


EXERCISE 4.8. Show that the evolute of the ellipse a(t) = (a cost, b sint) is the 


astroid 4 ‘ r é 
(<= cos? t, Dee sin® t,) Z 
a b 


EXERCISE 4.9. Show that an evolute of the catenary a(t) = (t, cosht) is given by 
(t — cosht sinht, 2 cosh f). 
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EXERCISE 4.10. Show that an evolute of the astroid a(t) = (a cos* t,a sin? t) is an 
expanded, rotated out-of-phase astroid. Hints: After computing the evolute, take 
an astroid (2a cos® t, 2a sin? t), rotate it by an amount guessed from the graph and 
let t=t— 7/4. 


1.5 SOME IMPLICATIONS OF CURVATURE AND TORSION 


We have already seen some of the ways in which the Frenet Formulas detect 
geometric information about the curve. As a more complicated example we 
will now consider a class of curves called cylindrical helices. In order to 
understand the definition of these curves, let us consider the standard helix, 
a(s) = (acos£, asin£, %) where c = (a? + b?)'/2. Consider T = a'(s) = 
(—2sin £, $cos &, B). Notice that the third coordinate is constant, so T-e3 = 
b (where e3 = (0,0, 1) as usual). Hence, cos@ = 8 so the angle 8 between T 
and e3 never changes. 

We may generalize this property by saying that a curve a, which we assume 
has unit speed, is a cylindrical helix if there is some constant unit vector u 
such that T’-u = cos@ is constant along the curve. The direction of the vector 
u is the azis of the helix. We can identify cylindrical helices by a simple 


condition on torsion and curvature. 


Theorem 5.1. Suppose a has k > 0 Then a is a cylindrical helix if and only 
if r/« is constant. 


Proof. Without loss of generality, assume a has unit speed. Suppose 7’: u = 
cos @ is constant for some u. Then 0= (Tu)! =T’-u=«N-u,soN-u=0 
since k > 0. Now, u is perpendicular to N, so 


u=(u-T)T+(u-B)B 
= cos@T + sin6 B. 


The diagram in Figure 1.20 explains the coefficient of B. 


FIGURE 1.20 


36 1. The Geometry of Curves 


Because u is constant, u’ = 0. Hence, 
0 =cos@T” + sin@ B’ 
= «cos6N —rsinON 
= (Kcos@ — rsin 8) N. 
Thus, «cos 6 — 7 sin = 0 which gives cot 6 = 7/« constant since @ is. 


Now suppose 7/k is constant. The cotangent function takes on all values, 
so we can choose 6 with cot @ = r/x. Define u = cos@T' + sin@ B to get 


uw = (Kcos6 —7sin6) N = 0. 
Hence, u is a constant vector and clearly T - u = cos@ is constant. Thus a is 
a cylindrical helix. C) 


In order to get a good geometric picture of a cylindrical helix we might want 
to project the helix back onto the plane determined by a starting point (G(0)) 
and the fixed (normal) vector u. The picture is given in Figure 1.21. 


A(s) 


G(0) 


FIGURE 1.21 


To do this, we subtract the u-component of 3(s) — 6(0) from ({(s). Now, this 
u-component has length 
h(s) = (G(s) — B(0)) -u 
and to see what this is in terms of the definition of @ as a cylindrical helix, 
we differentiate, 
— =f -u=T-u=cos8, 
a constant. Hence, h(s) = scos8+C. Now h(0) =0=0+4+C, so C =0 also. 
Finally, h(s) = scos@. Therefore the projection of the cylindrical helix G(s) 
onto the plane through G(0) and perpendicular to u is given by: 
7(s) = A(s) — h(s) u, 
where h(s) = scos@. Note that this parametrization of y is not necessarily a 
unit speed parametrization. 
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EXERCISE 5.1. Show that x = xg/ sin? 6, where kg is the curvature of (. 


EXERCISE 5.2. A cylindrical helix @ is said to be a circular helix if the projection 
7 is a circle. Show that @ is a circular helix if and only if 7 and « are constant. 


EXERCISE 5.3. What can you now say about the curve considered in Exercise 3.8, 


_fa+s? a-s'? s 
aa)= (Gree aa . 


EXERCISE 5.4. What do you know about the curve ((t) = (t + V3sin(t), 2 cos(t), 
/3t — sin(t))? 


EXERCISE 5.5. Show that @(t) = (at, bt?, t3) is a cylindrical helix if and only if 
4b* = 9a”. In this case, find the vector u such that T = G’/|G’| has T-u = constant. 


EXERCISE 5.6. Let G(s) be a curve and suppose that, for every s, N(s) is perpen- 
dicular to a fixed vector u. Show that @ is a cylindrical helix. Conversely, show 
that, for any helix, the normal N is perpendicular to the axis u of the helix. 


EXERCISE 5.7. Show that, for any helix, the Darboux vector w is parallel to the 
axis u of the helix. 


EXERCISE 5.8. Define a curve §(s) by G(s) = i Boa(t) dt, where a(t) is a unit 
speed curve. Show that kg = Ta, Tg = Ka, Tg = Ba, Ng = —No and Bg = Ta. 
Now suppose a is a circular helix and show that £ is as well. 


EXERCISE 5.9. Show that the space evolute €(s) of a plane curve G(s) is a cylindri- 
cal helix. Hint: Take €(s) to be unit speed and note, by definition, that the involute 
of € is 3. Compute torsion of 3 and set it to zero since @ is a plane curve. This 
gives conditions on derivatives of €. Use the Frenet formulas applied to €. 


We have seen that curvature and torsion, individually and in combination, 
tell us a great deal about the geometry of a curve. In fact, in a very real 
sense, they tell us everything. Precisely, if two unit-speed curves have the 
same curvature and torsion functions, then there is a rigid motion of R® taking 
one curve onto the other. Furthermore, given specified curvature and torsion 
functions, there is a curve which realizes them as its own curvature and torsion. 
These results are, essentially, theorems about existence and uniqueness of 
solutions of systems of differential equations. The Frenet Formulas provide 
the system and the unique solution provides the curve. We shall not prove 
these statements, but instead indicate portions of the proofs by the following 
exercises. 


38 1. The Geometry of Curves 


EXERCISE 5.10. Suppose we want a plane curve 3(s) with curvature a specified 
curvature function «(s). Show that such a @ may be defined by 


B(s) = ([ cos 6(u) du, : sin 0(u) au) 


where 6(u) = fo’ «(¢) dt. Hints: use the Fundamental Theorem of Calculus to show 
G has unit speed and then compute Tp. 


EXERCISE 5.11. Find a unit speed plane curve a(s) with ka(s) = 2s and a(0) = 
(0,0). The resulting curve is called Euler’s spiral or the spiral of Cornu. We will 
see in Chapter 8 that this curve arises as the solution to a variational problem. 


EXERCISE 5.12. Find a unit speed plane curve a(s) with Ka(s) = 1/(1 + 8”) and 
a(0) = (0,0). 


EXERCISE 5.13. Find a unit speed plane curve a(s) with ka(s) = 1/s and a(1) = 
(3, —2). 


EXERCISE 5.14. Show that, if two unit speed curves a(s) and G(s) have ka(s) = 
kg(s) and ta(s) = Tg(s) for all s, then there is a rigid motion (ie. combination of 
translations, rotations and reflections) of 3-space which takes a onto @. Hints: 


(1) Move G(0) to a(0) by a translation. 

(2) Rotate and, if necessary, reflect the Frenet frame of 8 at (0) = a(0) into 
the Frenet frame of a. 

(3) To show that the Frenet frames now coincide for all s, consider the function 


D(s) = Ta(s) -Tg(s) + Na(s) + No(s) + Ba(s) - Ba(s) 


which attempts to measure the deviation of the frames from being identical. 
Show that D(0) = 3, D(s) < 3 for all s and D(so) = 3 only when the frames 
coincide at so. 

(4) Show that D is constant by calculating D’ = 0. Use the Frenet equations. 
Hence, D(s) = 3 for all s. 

(5) Then, a/(s) = 6’(s). Integrate and use initial conditions to show a(s) = 
3(s), hence showing that the original translation, rotation and reflection 
carried ( into a. 


1.6 THE GEOMETRY OF CURVES AND MAPLE 


Computer algebra systems provide powerful tools for computation and vi- 
sualization in differential geometry. Applications range from simple compu- 
tations such as numerical evaluation of an arclength integral to more compli- 
cated graphics such as drawing geodesics on surfaces. No matter the appli- 
cation, the interaction of human geometer with computer is a very personal 
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thing. Therefore, in this section and all similar ones dealing with computers, 
I shall give (in the first person) my personal version of computers and geom- 
etry. I make no claims that the ways I have of doing things are optimal. In 
fact, I’m certain that many readers will have much better ways of approaching 
various problems. Nevertheless, I believe it is worthwhile nowadays to expose 
any prospective geometer to the world of computer algebra systems. In this 
text, I have chosen to use MAPLEV3 for WINDOWS (hereafter known as 
MAPLE) because this is the system I was exposed to first (in the post DOS 
age) and this is the system that I believe is the simplest for students to learn. 
MAPLE behaves similarly in its MAC (or Power PC) version although there 
/ are certain annoying features such as “Scratchpad” which must be dealt with. 
There are now many introductions to MAPLE on the market. For a straight- 
forward list of commands and package descriptions, however, take a look at 
[Red]. For a MATHEMATICA approach to geometry, the reader is referred 
to [Gr]. Let’s begin. 

Much of what we do in geometry deals with vectors, of course, and the 
easiest way to enter them in MAPLE is to write the following input. 


> A := [a,b,c]; 
This gives the following output. 


A: = [a,b,c] 
Components of the vector A may be isolated as follows: 


>A[1]; 
a 


Notice the semicolon at the end of an input. Without the semicolon, MAPLE 
does not know that the input has ended. Also, as we shall see below, there 
are times when a colon is preferred over the semicolon. A colon still causes 
MAPLE to execute the input, but suppresses the output. To try a simple 
example, input both of the following. 


>with(plots); 
>with(plots): 


The command with(plots) loads a special plotting package into a MAPLE 
worksheet (which is just the name of the WINDOWS screen where you do 
and save your work). This plotting package will be necessary for us later. 

Of course, we want computer algebra systems to carry out tasks which are 
tiresome to do by hand — such as the calculation of curvature « and torsion T. 
In order to see how this may be accomplished, let’s first look at some simpler 
calculations. The first set of procedures below are for dot product, the length 
(or norm) of a vector and for cross product. MAPLE has built-in procedures 
for these operations (e.g. dotprod), but in the old DOS days, there were some 
problems with using them in the geometric procedures below. Also, the very 
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simplicity of the procedures themselves is a painless way to learn the art of 
MAPLE procedure writing. To learn more about the built-in dot product, for 
example, use MAPLE’s most important command — ?dotprod. The ? is a 
signal for MAPLE to give you help, in this case on the dot product. Try ?plot. 
Now, type in the following. (To go from one line to the next in WINDOWS, 
use Shift-Enter or Shift-Return, where the dash means to hold down Shift 
while you press Enter. On the Power PC version, the Return key alone takes 
you to a new line while the Enter key on the far right enters input.) 


> dp := proc(X,Y) 
2 X{1}*Y[1]+-X[2]*Y([2]+X[3]+*Y [3]; 
> end: 


Hit the ENTER key on the keyboard (or double-click with the mouse). The 
dot product procedure is now entered into MAPLE for this particular work- 
sheet. Every time you begin MAPLE and open this worksheet, however, this 
procedure must be re-entered —- this goes for everything! The dot prod- 
uct procedure above does exactly what we would do as human calculators. 
Namely, it takes as input two vectors, X and Y, identifies their components 
by the bracket notation X[1] etc. and carries out the dot product multiplica- 
tion and addition. Notice that there is no colon or semicolon after proc(X ,Y) 
and there is a colon after the “end” to end the input (i.e. the procedure). For 
example, the following input produces a dot product output. 


> B := [k,],m]: 
> dp(A,B); 
ak+b6l+cm 


We could give this calculation a name as follows. 


> try := dp(A,B); 
try: =ak+6l+cm. 


The next procedures compute the length of a vector and the cross product 
of two vectors. By the way, note that MAPLE uses the usual computerese 
versions of multiplication (*), exponentiation (~) and square root (sqrt). 


> nrm := proc(X) 
> —— sqrt(dp(X,X)); 
> end: 


> xp := proc(X,Y) 

local a,b,c; 

a := X[2]*Y[3]-X[3]*Y[2]; 
b := X[3]*Y[1]-X[1]*Y[3}; 
c := X[1]*Y[2]-X[2]*Y[1]; 
ee 


VVVVVV 
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EXERCISE 6.1. Compute nrm(A,B) and xp(A,B) for the vectors A and B above. 


Once you have written one procedure, you can write a thousand. The following 
procedures compute curvature and torsion from the nonunit speed formulas of 
section 1.4. These procedures assume that the curve is given parametrically 
in the variable ¢ in three coordinates. If the curve is a plane curve in the ry- 
plane, say, then make the third coordinate zero. Note that, in the line right 
after proc( ), the word “local” appears. This tells MAPLE that the variables 
following “local” are to be considered only within the procedure (usually just 
for intermediate calculational purposes), so that, later on in the worksheet, 
invoking one of the local variables produces no response. 


> curv:=proc(alpha) 

local alphap,alphapp,num; 

alphap:=diff(alpha,t); 

alphapp:=diff(alphap,t); 

num:= nrm(xp(alphap,alphapp)); 

RETURN (kappa=simplify(num/nrm(alphap)~3)); 
end: 


tor:= proc(alpha) 
local alphap,alphapp,alphappp; 
alphap:=diff(alpha,t); 
alphapp:=diff(alphap,t); 
alphappp:=diff(alphapp,t); 
num:=dp(xp(alphap,alphapp),alphappp); 
RETURN(tau=simplify(num/nrm(xp(alphap,alphapp))~ 2)); 
end: 


VNNV VV WM VY OM VOW Voy 


In both procedures above, MAPLE differentiates each coordinate function 
in a parametrized curve “alpha” by the simple command diff(alpha,t);. The 
tangent and acceleration vectors “alphap” and “alphapp” are defined as local 
variables because they are not the objects of the calculation. They are only in- 
termediate quantities necessary to calculate curvature. In fact, in MAPLEV3, 
all variables are implicitly assumed to be local, but it is still a good idea to 
declare them as such — not only to avoid annoying “warnings” telling you the 
variables are “local,” but to make your procedure clear to anyone who reads 
or uses it. If it should ever be necessary to have a variable in a procedure pass 
out of the procedure to the rest of the worksheet, then you should declare 
the variable to be “global” (see ?global). Also, the command (all in capital 
letters) “RETURN” tells MAPLE to explicitly output whatever is contained 
within the parentheses. While this looks nice as output (i.e. K =...), if the 
output of the procedure is required in a later procedure, the output will be 
read as an equation rather than a quantity. To get around this, we can pick 
off the quantity itself by taking its righthandside (see below). 
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Remark 6.1. I like to keep “separator lines” between individual inputs or 
between a procedure, which has no separating lines within itself, and another 
input. This is a particularly good idea on the Power PC since it seems to try 
to execute every unseparated thing in sight. I will not include separator lines 
here, though, since it makes for difficult reading. Also, Power PC users should 
not have prompts > before every line in a procedure. PPC MAPLE will think 
each line is a separate input instead of a procedure. Secondary prompts >> 
are fine. 


Now, finally, here’s an example. Let’s take a helix. 
> hel:=[a*cos(t),a*sin(t),b-«t]; 

hel: = [acos(t),asin(t), bt] 
> curv(hel); 


_ a 
+e 

> tor(hel); ; 
7 e+e 


To take the righthand sides of these equations, we input 
> rhs(curv(hel)); rhs(tor(hel)); 
oe oes 
a* + b? a? + b? 
In order to evaluate the curvature and torsion of a particular helix, it is not 


necessary to redefine the whole curve. The command “subs” allows substitu- 
tions to be made into a general definition as in the following. 


EXERCISE 6.2. Input the following to compute curvature and torsion for the helix 
(4cos(t), 4sin(t), 2t). 

> hell:=subs({a=4,b=2}, hel); 

> curv(hell); 

> tor(hell); 


As I have mentioned above, before special types of plots can be made, the 
following command must be issued. 
> with(plots): 


With with(plots) we can now graph a red helix in three dimensions. The plot 
command is as follows. 


> spacecurve(hel1 ,t=0..10*Pi,scaling=constrained ,thickness=2,color=red); 


Here, hell is plotted as t varies from zero to 107. The entries following the 
domain of t are plotting options which make the plot appear a certain way. 
For example, “scaling=constrained” makes distance along the axes the same in 
each direction and “thickness” makes the curve thicker or thinner as required. 
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EXERCISE 6.3. Plot the helix with options “scaling=unconstrained,” “thickness=1,” 
and “color=blue.” For other options, type ?plot3d[options]. The command “space- 
curve” is a curve version of MAPLE’s “plot3d” command which we shall use exten- 
sively later. 


To plot plane curves, we can use the “plot” command. For example, let’s 
take the witch of Agnesi. There will always be some amount of trouble going 
from space curves to plane curves. For example, our dot product is calculated 
using three coordinates, so defining a plane curve with two coordinates will 
cause problems with the curvature procedure. On the other hand, plotting a 
plane curve with MAPLE’s “plot” command requires exactly two coordinates. 
It’s fairly simple to get around this trouble by first defining the witch with a 
fake third coordinate as 


> witch:=[2«*tan(t),2*cos(t)~ 2,0]; 
and then taking its curvature 
> curv(witch);. 


However, you cannot say “plot(witch, ...)” as we did for “spacecurve” for two 
reasons. First, since we’ve used three coordinates, MAPLE’s “plot” command 
will not work. Secondly, even if we use two coordinates to define the witch, 
MAPLE?’s “plot” command uses an annoying syntax which will not allow an 
input such as “plot(witch,t= -1.3..1.3);.” You must say (in this exact form) 


> plot([2*tan(t),2*cos(t)~2,t=-1.3..1.3],scaling=constrained); 


or use the first two coordinates of the witch as follows (with an optional “axes” 
parameter) 


> plot([witch[1],witch|2],t=-1.3..1.3],scaling=constrained,axes= framed, 
title=“The Witch,” titlefont=[TIMES,ITALIC,20]); 


Notice that, unlike “spacecurve,” the square brackets must go around the 
t =... as well as the parametrization. Also notice that we have included 
“title options” to give the plot a title with a particular font. For details, see 
MAPLE’S help page for title (i.e. type in ?title). Now try using options such 
as “constrained” or “color” or “axes” or “thickness” for the “plot” command 
and see what you get. 

We can use MAPLE procedures to graph more complicated curves. For 
example, the second procedure below takes input a curve and gives output 
the involute of the curve. The first procedures gives the arclength of a curve 
from 0 to t. 


> arc:=proc(alpha) 

local alphap ; 
alphap:=diff(subs(t=u,alpha),u); 
int(nrm(a]phap),u=0..t); 

end: 


VVVV 
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Notice that this procedure expects the curve “alpha” to be parametrized by 
“¢” and that, at the very first stage, “t” is replaced by a dummy variable “u.” 
Finally, the length of the tangent vector in terms of “u” (i.e. the speed) is 
integrated to produce an arclength in “t.” Using this arclength procedure, we 
can write a simple involute procedure. 


> involute:=proc(alpha) 

local aa,bb,cc; 

aa:=simplify(alpha{1]-arc(alpha)*diff(alpha{1],t) /nrm(diff(alpha,t))); 
bb:=simplify (alpha|2]-arc(alpha) *diff(alpha|[2],t) /nrm(diff(alpha,t))); 
cc:=simplify (alpha[3]-arc(alpha) *diff(alpha|3],t) /nrm(diff(alpha,t))); 

[aa,bb,cc]; 

end: 


VVVVVV 


For example, we have the following input and output. 


> circ:=[cos(t),sin(t),0]; 


circ := {cos(t), sin(t), 0] 


> inv:=involute(circ); 


inv := {[cos(t) + tsin(t), sin(t) — tcos(t), 0] 


If you wish to graph the circle and its involute together, you can do this by 
creating a plot structure for each and then displaying both at once. Also, 
while it is possible, as we have seen before, to use MAPLE’s “plot” command, 
we will use “spacecurve” here to avoid the bracket problem mentioned above. 


> circplot:=spacecurve(circ,t=0..2*Pi,color=black,thickness=1): 

> invplot:=spacecurve(inv,t=0..2*Pi,color=red,thickness=2): 

> display({circplot,invplot},scaling=constrained,axes=framed); 

EXERCISE 6.4. Show that the involute of the helix (cost,sint,t) is a plane curve. 


What plane curve is it? 


Similarly, we can write a procedure to find the evolute of a given curve. 
Recall that, given a(t), its evolute is defined by €(t) = a(t) + 1/K(t) N(t) 
where N is the normal of a. 


EXERCISE 6.5. Show that 


i la’ |? 


! uw / 
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Hint: use the nonunit speed Frenet formulas. 
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FIGURE 1.22. Involute of a circle 


The exercise leads to the procedure 


> evolute:=proc(alpha) 

local aa,bb,alphap,alphapp,A,C; 
alphap:=diff(alpha,t); 

alphapp:=diff(alphap,t); 

A:=nrm(alphap)~ 2/nrm(xp(alphap,alphapp))~ 2; 
C:=xp(xp(alphap,alphapp),alphap); 
aa:=simplify(alpha[1]+A*C[1)); 
bb:=simplify(alpha[2]+A*C[2]); 

RETURN (Evolute=[aa,bb]); 

end: 


VVVVVVVVV 


The following input and output gives the evolute of an ellipse. 


> ellipse:=[a*cos(t),b*sin(t),0]; 


ellipse := [acos(t), bsin(t), 0] 
> evolute(ellipse); 
cos(t)3(a? — 6?) sin(t)(—a? + a? cos(t)? + b* — b? cos(t)? 


Evolute = | —-—-___—- 
volute = ; 5 
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Notice that MAPLE simplifies the first coordinate, but not the second. In fact, 
although there does not seem to be a way of inducing MAPLE to simplify both 
coordinates simultaneously, the second coordinate simplifies by hand to 


sin(t)?(b? — a?) 
b b) 
so that the evolute of an ellipse is an astroid. By the way, to plot the astroid, 


it is necessary to first peel off the name “Evolute.” This may be accomplished 
by taking 


>ev:=rhs(evolute(ellipse); 


_ aus — 6?) sin(t)(—a? + a? cos(t)? + b* — b? cos(t)? 
a 


and plotting ev as we have done before. 


EXERCISE 6.6. Show that the evolute of the catenary (t, cosh(t)) is given by 
(t ~ cosh(t) sinh(t), 2cosh(t)). Plot this evolute. 


EXERCISE 6.7. Do Exercise 1.4.10 by finding the evolute of the astroid (acos*(t), 
asin°(t)) and then following the instructions below. 


(1) Take the astroid (2acos*(t),2asin°(t)) (thought of as a column vector) and 
multiply it on the left by the rotation matrix 


oe ea 


using MAPLE’s “matrix” and “matrix evaluation” commands. Before you can define 
a matrix, you must load MAPLE’s linear algebra package by typing 


>with(linalg): 
Now the rotation may be accomplished as follows. 


> astr3:=evalm(matrix(2,2,[sqrt(2) /2,-sqrt(2)/2,sqrt(2)/2, sqrt(2) /2])* 
[2xaxcos(t)~ 3,2*axsin(t)~ 3]); 


(2) Now shift the phase of the rotated astroid by 1/4 by taking 

> expand(subs(t=u-Pi/4,astr3[1])); 

> expand(subs(t=u-Pi/4,astr3[2])); 

Finally, compare your result to the expression for the evolute of the astroid above. 


(3) Plot both astroids on the same set of axes using the “display” command. You 
should get 
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FIGURE 1.23. Astroid and its evolute 


Now let’s use Exercise 1.5.10 to try to recreate a plane curve from its 
curvature alone. While the formula involves integrals which can only rarely 
be solved, we can still plot the resulting curve. The exercise shows that a 
plane curve £ is determined by its curvature as follows: 


Bayes ( | * Boabluvee i "gibt iu) 


where 0(u =f K t)dt. By the Fundamental Theorem of Calculus, we can 
nanves a ae into a system of differential equations to be solved 
numerically and plotted. The system is 


dO ue 


dg ds 
ds s) 


+ = = cos(8(s)) aa = sin(6(s)). 


a 
vV—— 


The result will be the unit speed curve with the specified curvature. As a 
first example, let’s take k = s itself. In MAPLE, we define the system and 
then use MAPLE’s differential equations solver to get a numeric solution. For 
details on its operation, type ?dsolve. 


> sys:=diff(theta(s),s)=s, diff(b1(s),s)=cos(theta(s)), diff(b2(s),s) 
= sin(theta(s)); 


> p:=dsolve({sys,theta(0)=0,b1(0)=0,b2(0)=0},{theta(s),b1(s),b2(s)}, 
type=numeric): 
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Notice that the numeric option in “dsolve” requires initial conditions as well 
as the system of equations. In fact, for certain equations, “dsolve” will return 
a solution in closed form from the system alone. The numeric solution may 
now be plotted using MAPLE’s “odeplot” command. 


> scurve:=odeplot(p,|[b1(s),b2(s)],-5..5,numpoints=200): 


> display(scurve,view=|-2..2,-2..2]); 


2 


ah 


FIGURE 1.24. Curve with «(s) = s 


EXERCISE 6.8. Create the unit speed curve with «(s) = 1/(1+ s). What is this 
curve? Can you prove it? 


Now let’s write a procedure to take an input curvature and output the 
curve. Additional inputs give bounds on the curve’s parameter and viewing 
region. 


> recreate:=proc(kap,a,b,c,d,f,g) 

> local sys,b1,b2,p,theta,pl; 

> sys:=diff(theta(s),s)=kap(s), diff(b1(s),s)=cos(theta(s)), diff(b2(s),s)= 
sin(theta(s)); 

> p:=dsolve({sys,theta(0)=0,b1(0)=0,b2(0)=0},{theta(s), b1(s),b2(s)}, 
type=numeric): 

> pl:=odeplot(p,[b1(s),b2(s)],a..b,numpoints=200,thickness=1, 
axes=framed,color=red): 

> display(pl,view=(c..d,f..g]); 

> end: 
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To specify a curvature function, give the parameter (t¢ say) and use the hyphen 
and greater than sign to construct an arrow -> to the effect on t. Some 
examples of curvature functions are 


> kapl:=t->t; 

> kap2:=t->t72:; 

> kap3:=t->exp(t); 

> kap4:=t->sin(t); 

> kap5:=t->sin(t) *t; 

> kap6:=t->sin(t)*t~ 2; 
> kap7:=t->1/(1+t72); . 


> recreate(kap5,-8,8,-2,2,0,3); (See Figure 1.24) 
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FIGURE 1.25. Curve with «(t) = t sin(t) 
EXERCISE 6.9. Recreate curves with the curvature functions listed above. 


We can also recreate curves in space from their curvature and torsion via 
the Frenet (differential) equations. The following procedure solves the nine 
Frenet equations together with the three equations defining the unit tangent 
vector of the curve and allows you to zoom in by defining a viewing region. 
The inputs kap and ta are the desired curvature and torsion and a,b bound 
the parameter of the solution curve while c,d,e,f,g,h give ranges for x,y and z 
respectively. 


> recreate3dview:=proc(kap,ta,a,b,c,d,e,f,g,h) 
> local sys,p,alph1,alph2,alph3,T1,T2,T3,N1,N2,N3,B1,B2,B3,pl; 
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a 
oO 


sys:=diff(alph1(s),s)=T1(s) diff(alph2(s),s)=T2(s), 
diff(alph3(s),s)=T3(s), 
diff(‘T1(s),s)=kap(s)*N1(s), diff(T2(s),s)=kap(s)*N2(s), 
diff(T3(s), s)=kap(s)*N3(s), 
(s),s)=-kap(s)+T1(s)-+ta(s)+B1(s), 
iff(N2(s),s)=-kap(s)*T2(s)+ta(s)+«B(s), 
diff(N3(s),s)=-kap(s)*T3(s)+ta(s)*B3(s), 
diff(B1(s),s)=-ta(s)*N1(s), diff(B2(s),s)=-ta(s)*N2(s), 
diff(B3(s),s)=-ta(s)*N3(s); print(sys); 
p:=dsolve({sys, alph1(0)=0,alph2(0)=0,alph3(0)= react 
T2(0)=0,T3(0)=0,N1(0)=0,N2(0)=1,N3(0)=0,B1(0)=0,B2(0)=0 
B3(0)=1}, {alphi (s) alph2(s),alph3(s),T1(s),T2(s),T3(s),N1(s), 
N2(s),N3(s),B1(s),B2(s),B3(s)},type=numeric): 
> pl:=odeplot(p,[alph1(s) alph2(s) alph3(s)],a..b,numpoints=200, 
thickness=1, axes=framed, color=red): 
> display(pl,scaling=constrained,view=(c..d,e..f,g..h]); 
> end: 


VVVVVVVVVVV 
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We can recreate a circular helix by taking « and 7 to be constants. For 
example, take « = 1/5 and r = 1/10 with the solution curve parameter 
ranging from 0 to 100, all viewed in a 20 x 20 x 20-cube, to get a standard 
helix. 


> recreate3dview(1/5,1/10,0,100,-10,10,-10,10,-10,10); 
EXERCISE 6.10. Use recreate3dview to recreate curves as follows. 
> recreate3dview(1/5,1/10,-20,100,0,18,0,8,0,30); 

> kap3d1:=t->t; 

> recreate3dview(kap3d1,1/10,0,10,0,2,0,2,0,1); 

> tau3d1:=t->t/10; 

> recreate3dview(kap3d1,tau3d1,0,10,0,2,0,2,0,1); 


Here is Viviani’s curve from Exercise 1.3.19. 

> viv:=[a*(1+cos(t)),axsin(t),a*2*sin(t/2)]; 

> vi:=spacecurve(subs(a=1,viv) ,t=-Pi..Pi,color=black,thickness=2): 

> sphere:=plot3d([2*cos(u)*cos(v),2*sin(u)*cos(v), 2*sin(v)],u=0..2*Pi, 
v=-Pi/2..Pi/2, style=wireframe, color=grey): 

> cyl:=plot3d({cos(u)+1,sin(u),v],u=0..2*Pi,v=-2..2,style=wireframe, 
color=red): 

> display({vi,sphere,cy]},scaling=constrained,orientation=([0,78)]); 

Now let’s use MAPLE to verify that Viviani’s curve is a spherical curve. We 

use the criterion for spherical curves given in Exercises 1.3.17-1.3.19. 


> kk:=simplify(rhs(curv(viv))); 
> tt:=simplify(rhs(tor(viv))); 
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FIGURE 1.26. Viviani’s curve 


> zz:=simplify((1/kk)~2+(1/tt)~2*(diff(1/kk,t)/ nrm(diff(viv,t)))~2); 


The curvature and torsion above therefore satisfy the equation 


2 
1)" a Se = = 4a? 
kK T\K/ lo(t)|) 

so that Viviani’s curve is truly spherical. 


EXERCISE 6.11. Show that the following curve is spherical. 


(a cos” (t), asin(t) cos(t), asin(t)) 


In Exercise 3.12, curves of constant precession were discussed. The recre- 
ate procedures above may be used to create curves of constant procession by 
choosing «(s) = asin(bs) and 7(s) = acos(bs) for constants a > 0 and 0b. 
However, in [Sco], an explicit parametrization for such curves is derived. Fur- 
thermore, it is shown that any such curve lies on a hyperboloid of one sheet 
(which we shall consider several times in this book) with the formula 


b 4b? 


as arr aak 
The explicit parametrization is given by 
c+b_, c-b, 
z(s) = Bele ~ by sin([c — b]s) — Bele + 5) sin([c + }]s) 
= c+b c—b 
y(s) = 2c(c — b) cos([c — b]s) + Dele + b) cos([c + b]s) 
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where c = Va* + b?. The following procedure graphs curves of constant pre- 
cession on the associated hyperboloids. 


> constprec:=proc(a,b) 


> local c,alpha,al,a2,hyp,pll,pl2 ; 

> c:=sqrt(a~2+b~ 2); 

> al:=c+b; 

> a2:=c-b; 

> alpha:=(al /(2*cxa2)*sin(a2+s)-a2/(2*cxal) «sin(alxs), 
-al /(2*c*a2)*cos(a2*s)+ a2/(2*ckal) «cos(alss), 
a/(bxc)xsin(bxs)]; 

> hyp:=([2*b/a~2«cosh(r)*cos(q),2*b/a~2 *cosh(r)«*sin(q), 
2/axsinh(r)]; 

> pll:=spacecurve(alpha,s=0..2*Pi,color=black,thickness=2, 
numpoints=200): 

> pl2:=plot3d(hyp,r=-1..1,q=0..2*Pi): 

> display({pl1,pl2},scaling=constrained,style=wireframe, 
shading=XY ,axes=framed); 

> end: 


EXERCISE 6.12. Graph the curves of constant precession with a = 15, b = 8 and 
with a = V6, b = V2. Can you make a conjecture about when curves of constant 
precession are closed curves? See [Sco]. 
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FIGURE 1.27. Constant precession curve with a = 4 and b = 3 


EXERCISE 6.13. Graph the unit tangent vector T(s) (ie. the tangent indicatrix) 
for curves of constant precession with a = 15, b = 8 and with a = V6, b = V2. Use 
“display” to graph T's) on the unit sphere. Can you make a conjecture about the 
number of cusps of such curves? For a discussion of tangent indicatrices of spherical 
curves, see [Sol]. 


Chapter 2 
SURFACES 


2.1 INTRODUCTION 


We see examples of surfaces in everyday life. Balloons, innertubes, cans and 
soap films provide physical models of surfaces. In order to study the geometry 
of these objects, what is required? Somehow, we must have coordinates to 
make calculations. But just because these surfaces reside in R* we shouldn’t 
let ourselves think of them as 3-dimensional. For instance, if we cut a cylinder 
lengthwise, then we can unroll it to lie flat on a tabletop. This indicates 
that surfaces are inherently 2-dimensional objects and, therefore, should be 
describable by two coordinates. This gives us our first clue as to an approach 
for describing the geometry of surfaces. Namely, we should try to spread 
part of the plane around a surface and, in terms of the required twisting 
and stretching, understand how the surface curves in space. The spreading 
provides coordinates to do calculus on a surface and various calculus-based 
quantities associated with the spreading describe the shape of the surface. 
Just as the calculus of the Frenet Formulas enabled us to describe the geometry 
of a curve, so also will the calculus of a surface enable us to describe the 
geometry of a surface. 

Let D denote an open set in the plane R?. The open set D will typically 
be an open disk or an open rectangle. Let 


x: D> R® (u,v) (2*(u,v), 22(u,v), £°(u, v)) 


denote a mapping of D into 3-space. The z*(u, v) are the component functions 
of the mapping x. We can do calculus one variable at a time on x by partial 
differentiation. Fix v = vo and let u vary. Then x(u, vo) depends on one pa- 
rameter and is, therefore, a curve. It is called a u-parameter curve. Similarly, 
if we fix u = uo, then the curve x(uo,v) is a v-parameter curve. Note that 
both curves pass through x(uo, vo) in R°. Tangent vectors for the u-parameter 
and v-parameter curves are given by differentiating the component functions 
of x with respect to u and v respectively. We write, 


w= (28 O82 Oe) (ak aa? an? 
“\ @u’ du’? du a y= \ ou’ Ov’ Ou) 
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FIGURE 2.1. Parameter curves with tangent vectors 


We can evaluate these partial derivatives at (uo,vo) to obtain the tangent, 
or velocity, vectors of the parameter curves at that point, x,(ug,vo) and 
Xy(uo, Vo). 

Of course, to obtain true coordinates on a surface, we need two properties. 
The first is that x must be one-to-one. In fact, as we will see later, we can 
relax this condition slightly to allow for certain self-intersections of a surface. 
Namely, we will allow those intersections which still give unambiguous normal 
vectors. Secondly, x must not “crinkle” parameter curves together so that x, 
and x, have the same direction. For, then, we lose 2-dimensionality — the 
very attribute of a surface we are trying to describe. The first property is 
verified by simple algebra or imposed by decreasing the size of D. But how 
can we show that the vectors x, and x, are never linearly dependent so that 
the second property holds? The following exercise provides a simple test. 


EXERCISE 1.1. Show that xy and xy are linearly dependent if and only if xy xxy = 
0. Recall that two vectors are linearly dependent if one is a scalar multiple of the 
other. 


A mapping x: D — R® from an open set D C R? is regular if xy x Xy + 0 for 
all points in D. (Of course, x, x x, must exist as well!) A coordinate patch 
(or parametrization) is a one-to-one regular mapping x: D > R?® of an open 
set D C R? to R?. A surface in R? is a subset M C R® such that each point 
of M has a neighborhood (in M) contained in the image of some coordinate 
patch x: D> MC R’®. 

Because each patch (e.g. x, y, Zz) on a surface is one-to-one (at least in 
a smaller open portion of the original domain), then the respective inverse 
functions exist (and, in fact, are continuous). We may, therefore, consider the 
composition of any pair of patches x~ty: D — R?. Such a map is differen- 


tiable (or smooth) if all partial derivatives Z, os, 2, --+ of each component 


function exist and are continuous. A surface is differentiable (or smooth) if 
x—'y is differentiable for any pair of patches on the surface. The surfaces we 
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FIGURE 2.2. A patch 


deal with will generally be smooth or close to it. That is, there may be a finite 
number of points which must be discarded to achieve smoothness. In virtually 
all of our examples, a surface M will be defined by a single patch with the 
exception of only a finite number of points. Indeed, we mention the notion 
of overlapping patches only because we use the idea later in our discussion of 
isometries and Gauss curvature. 

A function f: M — R from a surface M is differentiable (or smooth) if the 
composition f ox: D — R is smooth for each patch x on M. A curve on a 
surface is simply a mapping from an interval of real numbers J = [a,b] C R 
to the surface M, a: I  M. The surface M is said to be path connected if, 
for any two points p, q € M, there is a curve a: [0,1] — M with a(0) = p 
and a(1) = q. In this book all surfaces, unless explicitly mentioned otherwise, 
will be assumed to be path connected. A curve is differentiable (or smooth) 
if x-! oa: I — R? is smooth for all patches x on M. The following lemma 
shows one reason why smoothness of surfaces is important for us. Namely, 
in order to understand curves on surfaces, it will be sufficient to understand 
parametric curves back in the open domains of R? rather than in the surfaces 
themselves. 


Important Lemma 1.1. Let M be a surface. Ifa: I > x(D)C Misa 
curve in R?® which is contained in the image of a patch x on M, then for 
unique smooth functions u(t), v(t): J > R, 


a(t) = x(u(t), o(2)). 


Proof. Because a is smooth, the composition x~! oa: I — D is smooth by 
definition. Now, D C R?, so x~!a/(t) = (u(t), v(t)). Hence 


a(t) = x(x7!a(t)) = x(u(t), (2). 
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In order to see that u(t) and u(t) are unique, suppose a = x(u(t), 0(t)) for two 
other functions & and v. Note that we can assume that u and @ are defined 
on J as well by reparametrizing. Then 


O 


To study a curve on a smooth surface now, we can look at the functions 
of one variable u(t) and v(t). Also, note that, if a curve a on M is not 
contained in a single patch, then we may study it by considering the parts 
of it which lie in separate patches and then piece the information together. 
Again we note that our examples shall deal only with curves contained in 
a single patch. Finally, suppose M: x(u,v) and N: y(r,s) are two surfaces 
defined by single patches. A mapping F: M — N simply associates a point 
of N, unambiguously, to a point of M. We write F(p) = q for p € M 
and q € N. A mapping F is differentiable (or smooth) if the composition 
y~!oFox: Dy, > Dy is smooth as a map of open subsets of R?. That is, the 
usual partial derivatives must all exist and be continuous. Of course, we may 
make the same definition for surfaces covered by many patches. We simply 
require that all such compositions are smooth. Mappings of surfaces may be 
quite complicated as we shall see later. We shall also see, however, that linear 
algebra allows us to get a reasonable linear approximation of the mapping 
which is quite understandable. 


Examples of Patches on Surfaces 


Example 1.1: The Monge Patch. 
The graph of a real-valued function of two variables z = f(x,y) is a surface 
in R°. To see this, define a patch by 


x(u,v) = (u,v, f(u, v)) 


where u and v range over the domain of f. Then, x, = (1,0, of) and x, = 
(O.4. of). From now on, when convenient, we shall denote partial derivatives 
of functions by 


Of Of 
du a fu Ou _ fu. 
The patch is regular, as can be seen from 
ef. Ok 
XuXXy=|1 0 ful = (fu; =o; 1) # 0. 
01 fy 


As an example, consider the paraboloid z = x? + y’. 
A Monge patch is given by 


x(u,v) = (u,v, u? + v?). 
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FIGURE 2.3. A paraboloid 


EXERCISE 1.2. What are the parameter curves for the paraboloid when ug = 
0, v9 = 0? 


EXERCISE 1.3. Find a patch for the cone z = \/x? + y*. Does something go wrong 
here? 


EXERCISE 1.4. Find a Monge patch on (part of) the unit sphere x? + y? tee 
Why can’t a Monge patch be defined on all of the sphere? 


Example 1.2: Spherical Coordinates. 

Let M be a sphere of radius R (centered at (0,0,0) for convenience). Recall 
that spherical coordinates make use of two angles and a radius from the center 
(which in our case is fixed at R). 


FIGURE 2.4. Spherical coordinates 
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For a point a distance R away from the origin, draw a line segment from 
(0,0,0) to the point. The parameter v, which varies as —1/2 < v < 7/2, 
measures the angle (in radians) from this line down to the zy-plane. If the 
line is projected onto the zy-plane, then u (with 0 < wu < 27) measures 


the angle of the projection from the positive x-axis. Consider the following 
triangles. 


S 


R Rcos(v) 
Rsin(v) 


LN 


FIGURE 2.5 


Hence, the xyz-coordinates of the point are 
(Rcosucosv, Rsinucosv, Rsinv) = x(u, v) 


with x, = (—Rsinucosv, Rceosucosv, 0) and x, = (—Rcosusiny, 
—Rsinusinv, Rcosv). Further, we compute the cross product to be Xy, X Xy 
= (R? cosucos* v, R* sinucos*v, R? sinvcosv), where, in the 3° coordinate, 
we use sin? u + cos? u = 1. Note that |x, x x,| = R?cosv. From now on, we 
shall use the notation S*(R) for the sphere of radius R centered at the origin 
and simply S? for the sphere of radius 1 centered at the origin. 


Example 1.3: Surfaces of Revolution. 


Suppose C' is a curve in the ry-plane and is parametrized by a(u) = (g(u), 
h(u), 0). Revolve C' about the z-axis. 
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FIGURE 2.6. A surface of revolution 
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The coordinates of a typical point P may be found as follows. The z- 
coordinate is that of the curve itself since we rotate about the z-axis. If 
uv denotes the angle of rotation from the ry-plane, then the y-coordinate is 
shortened to ycos uv = h(u) cosv and the z-coordinate increases to h(u) sin v by 


h(u)sin(v) 


FIGURE 2.7 


Hence, a patch may be defined by: 
x(u,v) = (g(u), h(u)cosv, h(u) sin v). 


If we revolve about a different axis, then the coordinates permute. For exam- 
ple, a parametrized curve (h(u),0,g(u)) in the xz- plane rotated about the 
z-axis gives a surface of revolution (h(wu) cos v, h(u) sin v, g(u)). 


EXERCISE 1.5. Check that 


Why is xy X xy #0 for all u,v? 


In general, g(u) measures the distance along the axis of revolution and h(u) 
measures the distance from the axis of revolution. The nicest situation is, of 
course, when g(u) = u, for then it is easy to see where you are on the surface 
in terms of the parameter uw. 


EXERCISE 1.6. Find a patch for the catenoid obtained by revolving the catenary 
y = cosh(z) about the z-axis. 
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FIGURE 2.8. A catenoid 


EXERCISE 1.7: The Torus. 
Consider the diagram 
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FIGURE 2.9. A torus of revolution 


and revolve the circle of radius r about the z-axis. Show that a patch is given by 


x(u,v) = ((R+rcosu)cosv, (R+rcosu)sinv, rsinu). 


How do wu and v vary? 


Take a helix a(u) = 


Example 1.4: The Helicoid. 
(acosu, asinu, bu) and draw a line through (0, 0, bu) 
and (acosu, asinu, bu). The surface swept out by this rising, rotating line 


is a helicoid. 
The line required is given by (0,0, bu) + v(acos u, asin u,0), so a patch for 


the helicoid is given by 
x(u, v) = (av cos u, av sin u, bu). 
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FIGURE 2.10. A helicoid 


EXERCISE 1.8. Show that x(u,v) is a patch by verifying that |xu x xv| # 0. 


(5) EXERCISE 1.9: Enneper’s Surface. 
Define a surface by, 


ue 2 v° 2 2 2 
x(u,v) = (u— = + uv ‘ ae seu ,u—v’). 


Show that this definition gives a regular patch. Then show that, for wt? <3, 
Enneper’s surface has no self-intersections. Hint: use polar coordinates u = rcos@, 
v =rsin#@, show that the equality ety? t+ 4 z= gr7(3 + r?)? holds, where z, 
y and z are the coordinate functions of Enneper’s surface, and then show that the 
equality implies that points in the (u, v)-plane on different circles about (0,0) cannot 
be mapped to the same point. Finally, find two points on the circle ute? =3 
which do map to the same point of the surface. 


FIGURE 2.11. Enneper’s surface 
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Ruled Surfaces 


A surface is ruled if it has a parametrization 
x(u, v) = B(u) + vd(u) 


where @ and 6 are curves. That is, the entire surface is covered by this 
one patch which consists of lines emanating from a curve G(u) going in the 
direction 6(u). The curve G(u) is called the directrix of the surface and and 
a line having 6(u) as direction vector is called a ruling. A surface is doubly 
ruled if it has two different ruled patches x(u, v) = G(u) + v6(u) and y(u, v) = 
a(u) + vy(u). Ruled surfaces may have points with x, x x, = §’(u) x 6(u) + 
v6'(u) x 6(u) = 0. These points are well-controlled, as Exercise 1.15 below 
shows. We have two very well known examples of ruled surfaces. 


Example 1.5: Cones. x(u,v) = p+ v6(u) where p is a fixed point. 


Example 1.6: Cylinders. x(u,v) = @(u) + vq where q is a fixed direction 
vector. 


EXERCISE 1.10. Find patches for the standard cone z = \/x? + y? and standard 
cylinder x” + y* = 1 which are ruling patches in the sense of (1) and (2) above. This 
explains why the names cone and cylinder are used for the general patches of types 
(1) and (2) above. 


EXERCISE 1.11. Show that the saddle surface z = ry is doubly ruled. See Figure 
212, 


FIGURE 2.12. A saddle 


EXERCISE 1.12. Show that the helicoid is a ruled surface. 
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EXERCISE 1.13. The hyperboloid of one sheet 


has a useful parametrization 


x(u,v) = (acoshucosv, bcoshusinv, csinhw). 


FIGURE 2.13. A hyperboloid of one sheet 


However, this is not a ruling patch. Find two (related) ruling patches for this 
surface. That is, show the surface is doubly ruled. Hint: A directrix is G(u) = 
(acosu, bsinu, 0). Let 6(u) = B’(u) + (0,0,c). Because they are ruled surfaces, 
hyperboloids of one sheet are useful in engineering. For example, one reason nuclear 
cooling towers are built in the shape of hyperboloids of one sheet is that they may 
be built with straight beams along the rulings (see [Whi]). Therefore, stress on the 
tower is small (i.e. there is no internal bending) and the tower is easy to build. 
What do you think another reason is for using such shapes? Also, look at the weave 
in the base of a wicker chair with a sea-shell back. What do you notice and why? 


EXERCISE 1.14. Another patch for the hyperboloid of one sheet is given by 


l—wuv UtYU uU—v 
,b yc 
ltuv itu il1+u0 


x(u, v) = (a ). 

For simplicity, take a = b = c = 1 corresponding to a? + y” — z* = 1. Show that the 
u and v-parameter curves of this patch are, in fact, the rulings of the two patches 
in the previous exercise. Hence, the hyperboloid has an orthogonal patch whose 
parameter curves are straight lines. Hint: (1) in x(u,v), let w = ym (2) for fixed 
v, find the intersection of the ruling with the unit circle in the ry-plane (3) show 

2 
that x(u,v) = (2, rene 0) + w(—2v,1- v*,1+ v7) has direction vector, through 


64 2. Surfaces 


a point (a,b,0) with a? +b? = 1, a multiple of (— sinr, cosr, 1), where the first ruled 
patch of the exercise above is y(r, s) = (cosr,sinr,0) + s(—sinr, cosr, 1). 


EXERCISE 1.15. Suppose M: x(u,v) = G(u) + v6(u) is a ruled surface with |G’| = 1 
and |6| = 1. Also suppose 6’ # 0, so that M is noncylindrical. Show that M may 
be reparametrized by y(u,w) = y(u) + w6(u), where 7’ - 6’ = 0. Note that y may 
not be unit speed. A curve such as 7¥ is called a line of striction for M. Show that 
any point on M where xy xX Xy = 0 must lie on the line of striction. Hint: write 
y(u) = B(u) + r(u)d(u), use y’ - 6 = 0 and solve for r(u). Let w= 0 — r(u). 


EXERCISE 1.16. Find the lines of striction for the helicoid and the hyperboloid of 
one sheet. 


EXERCISE 1.17. Suppose M: x(u,v) = B(u) + v6(u) is a ruled surface. Show that 
M may be reparametrized by y(u, w) = e(u) + w6(u), where e’- 6 = 0. Hint: write 
e(u) = B(u) + s(u)d(u), use e’ -6 = 0 and solve for s(u). Your answer may be an 
integral. We will use this result in proving Catalan’s Theorem later. Find such a 
parametrization for a cylinder x(u, v) = B(u) + vq. 


The patches defined in this section provide a variety of examples of surfaces 
on which we will later test our notion of curvature (and its effect on geometry). 
We must note several important points, however, before we go on. 


(1) Occasionally our patches miss one or several points on a surface. This 
defect in a particular defining patch usually means that a single patch is not 
really sufficient to completely define the surface. In general, many patches 
may be required to cover a surface. Ask why, for example, in our patch on 
the sphere, the North and South poles are missing. 


(2) Other easily checked conditions also give us surfaces. For example, 
if we have a level set g(x,y, z) = c, then the Implicit (or Inverse) Function 
Theorem (IFT) may be invoked to show that the nonvanishing of the gradient 


of g, Vg = (%, af, ge , at all points of the level set, ensures that the level 
set is a surface. Essentially the IFT constructs a patch, so, in some sense, 
what we have done is more general. Furthermore, the “patch approach” allows 
much cleaner calculations later on. Thus, we have emphasized patches over 


other conditions for surfaces. 


FORESHADOWING EXERCISE 1.18. In the plane our standard coordinate system 
uses the vectors e; = (1,0) and eg = (0,1) as xy and x,y since the parameter curves 
are horizontal and vertical lines respectively. We have e; -e; = 1, e; -e2 = 0 and 
e2:e2 = 1. Use the notation FE = xy-xu, F = Xu- Xv, G = Xy + Xy and compute 
E, F, and G for all the patches given in this section. What is the same and what is 
different when a specific patch is compared to the plane itself? In particular, what 
does F tell you? 
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2.2 THE GEOMETRY OF SURFACES 


Now that we have an idea of what a surface is, how do we detect its 
geometry? One of the most important techniques in mathematics and, indeed, 
all of the natural sciences, is that of linear approximation. By this we mean 
the following. We recognize that the nonlinear or curved object at hand is too 
complicated to study directly, so we approximate it by something linear: a 
line, a plane, a Euclidean space. We then study the linear object and, from it, 
infer results about the original curved object. Of course this is exactly what 
we do when we attach the linear Frenet frame to a curve. This process also 
is useful in subjects ranging from differential equations to algebraic topology. 

The question is, of course, what type of linear space may be used to ap- 
proximate a surface? Just as, in one-variable calculus, we use a tangent line 
to approximate a curve near a point, so we can use a tangent plane T,(M) to 
approximate M near p € M. A plane consists of vectors and the natural idea 
for the make-up of T,,(M) is to use the vectors which arise as velocity vectors 
of curves on M. 

Formally, say that a vector v, € T,(M) is tangent to M at p if vp is the 
velocity vector of some curve on M. That is, there is some a: J — M with 
a(0) = p and a’(0) = vp. Usually we write v instead of vp when no confusion 
will arise. Then, the tangent plane of M at p is defined to be 


T,(M) = {v | v is tangent to M at p}. 


Immediately we know two curves which pass through p = x(ug, vo): the u 
and v-parameter curves with velocity vectors x, and xy. The following result 
says that every tangent vector is made up of a (unique) linear combination of 
X, and xy. Hence, {Xy,X,} form a basis for the vector space T,(M). (Recall 
that this is what the term ‘basis’ means.) 


Lemma 2.1. v € T,(M) if and only if v = A;xXy + A2Xy, where X,, Xy are 
evaluated at (ug, vo). 


Proof. First suppose that a is a curve with a(0) = p and a’(0) = v. We 
saw before that a(t) = x(u(t), v(t)), so the chain rule gives a’ = x,,(du/dt) + 
Xy(du/dt). Now, a(0) = p = x(u(0), v(0)), so u(0) = up and v(0) = vo (since 
x is one-to-one) and 

du dv 


v =a'(0) = Xu (uo, vo) = (0) + Xy(uo, vo) — 


Hence, A; = (du/dt)(0) and Az = (dv/dt)(0). 

Now suppose v = AiX,+A2X, (where x, and x, are evaluated at (uo, vo)). 
We must find a curve on M with a(0) = p and a’(0) = v. Define this curve 
using the patch x by 


a(t) = x(Ug + tri, vo + tA2). 
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Then a(0) = x(uo, vo) = p and 
d(uo + tr) 
dt 
d tr 
+ Xy (uo + tax, Vo + pro) Heo + a) 
= Xz, (Uo + tA1, Vo + tA2)A1 + Xv(Uo + tAr, Vo + tA2) Az. 


a’ (t) = Xy(uo + tA1, Vo + tr2) 


Thus, a/(0) = x,Ai + XpAz. O 


How do we identify the tangent plane via an equation? The same idea 
works for any plane in R°. A plane is determined by a point p and a normal 
vector (at p) to the plane N. Any q is in the plane exactly when q — p is 
perpendicular to N. We then have, N - (q — p) = 0 or, for g = (2,y,2), 
eS (Zo, Yo, 20) and N = (a, b, c), 


ax + by + cz — [azo + byo + czo] = 0 
ax +by+cez+d=0 


where d = —[axp + byo + cZ0]. This is, of course, the usual equation of a plane. 

In fact, we already know how to get a normal vector for T,(M). Since 
{X,,, Xv} forms a basis for T,(M), we need only find a vector perpendicular to 
both x, and x, (and hence to any v = A1Xy + A2X,). Such a vector is given 
by the cross product x, x X,. Therefore, if x is a patch for a surface with 
x(uo, vo) = p, then a normal N for T,(M) is given by, 


N = 2,(uo, Vo) X Xy(uo, Vo). 


FIGURE 2.14. Surface with normal 
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Example 2.1: A Function of Two Variables. M: z = f(z,y). We have 
a Monge patch x(u,v) = (u,v, f(u,v)) with x, x x, = (—fu, —fy,1). If 
p = (uo, vo, f (uo, vo)), then the equation describing T,(M) is, 


—fu(uo, vo)(% — uo) — fu(uo, vo)(y — vo) + (2 — f(uo, vo)) = 0. 
For instance, if z = x? + y”, then we have, for p = (x0, Yo, 2 + y2), 
—229(x — xo) — 2yo(y — yo) + (z — (#8 + ¥8)) =0 


which reduces to 
22or + 2yoyt+z= xe + Ye. 


Specifically, for p = (1,0, 1), we have 27+ z = 1 as the equation of the tangent 
plane. 


Example 2.2: The Sphere of radius R. zx? + y? + 2? = R?. 
Recall that the unit normal may be written, 


ix x, = (R? cos ucos? v, R? sin ucos? v, R? sin v cos v). 


(Note that x, x x, = (Rcosv) p, a multiple of the point on p the sphere. This 
verifies that the radial vector from the origin to the sphere meets the sphere 
at an angle of 7/2.) Suppose p = (R,0,0), so that u = 0 and v= 0. Then 
Xy, X Xy at p is (R?,0,0) and T;,(M) is given by, 


(R?,0,0)- (x — R,y,z) =0 
R?z = R° 
C=; 


This is, of course, the plane which is tangent to the sphere at (R, 0,0) 


In fact, we shall rarely be interested in the equation of the tangent plane 
— it is the normal itself which will detect the geometry of the surface. If 
we know how the direction of the normal to a surface changes as we move 
in the direction determined by a tangent vector, then we will know how the 
surface itself curves in that direction. There is a point which must be made 
here. When a surface requires more than one patch to cover it, there is the 
possibility that the normals defined from the various patches are incompatible 
with each other in a fundamental way. We refer the reader to [DoC1, p. 102] 
for details. Such a surface is said to be nonorientable and the first example is 


Example 2.3: The Mébius strip. Consider the regular parametrization 


x(u,v) = ((2 —v sin (=)) sin u, (2 —v sin ()) COS u, U COS (=)) 
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for 0 < u< 27 and —1 <v <1. This patch omits the interval where u = 0, 
so another patch is needed. This patch may be given by y(t, v) = 


te eee a _, {wT & ee al nt 
((2- a sin (= + 5)) cosa, (2-ssin(7+5))sina,o cos (7 + =)) 


which omits & = 0 corresponding to u = 7/2. No matter how we try to 
arrange it, these patches lead to the conclusion that the unit normal (of x 
say) must be equal to its negative! A picture of the Mdébius strip indicates 
how a normal may reverse itself as it travels around a closed curve. 
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FIGURE 2.15. A Mobius strip 


We have been rather vague about this notion of orientability since it will 
rarely come up in the rest of the book. In particular, a single regular patch x 
is always orientable since an explicit unit normal 


ik XS 


* |Xy X Xp| 


exists at each point. (Note that switching the order of x, and x, gives —U at 
every point of the patch.) This leads to the general definition that a surface is 
orientable if it has a smooth vector field of unit normals defined everywhere 
on it. Except for isolated examples such as Henneberg’s surface, which has a 
Mobius strip contained in it, the surfaces we deal with will be orientable. 
Now let’s return to our discussion of how the change in unit normal detects 
geometry. In order to see how we might calculate this change in a given 
direction, first let us see how a tangent vector acts to change a function. 
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Let g(z,y,z) be a function and let a(t) = (a'(t),a?(t),a3(t)) be a curve. 
Then the chain rule of multivariable calculus gives 


d _ Ogdx , Agdy | Ogdz 
BO aan Soy dE ae a 
_ Agdat , ag da , 09 da! 
~ Ox dt Oy dt = Oz dt 


Og Og Og da’ da? da? 
Ox’ oy’ Oz ‘dt’ dt’ dt 
= cciol def Og Og Og 

= Vg(a(t))-a'(t) where Vg = (32. By’ x). 


Now let v € T,(M) and suppose g is restricted to M C R°. We might ask how 
g changes on M in the v direction. But the v-direction on M itself means in 
the direction of a curve a with a’ = v at p. The directional derivative of g in 
the v-direction is 


vial(e) 2 S(9(o(t)) leao= Va(p) -v 


where a(0) = p € M and a’(0) = v. (Note that the last equality shows 
that the directional derivative does not depend on the curve chosen through 
p with velocity vector v.) Hence, v acts on a function g: M — R to produce 
a scalar v{g]. This is why the notation v[g] is chosen to denote directional 
derivative. Notice that the usual directional derivative of multivariable cal- 
culus has precisely the same definition, but the given function g may not be 
restricted to a surface M and the direction vector v may be any vector of R°. 
In many cases v may denote a vector field on M — asmoothly varying choice 
of tangent vector at some collection of points p € M — and then, v|g] gives 
a new function on M defined by vig](p) = vplg]. (Here we say that a vector 
field v smoothly varies on M if, for any curve a: I — M, the assignment 
t+ v(a(t)) is differentiable.) 

If f is a function on M, then the composition with a patch x may be 
formed, f(x(u,v)) = f ox, so that on the piece of M given by the patch, f is 
a function of u and v. Hence we may write 


Of _ A fox) 
dus Ou 
and similarly for v. Then, since (x(u,vo)) = Xu, we have 
6) 
xulf] = £(F((u,00)) lucuo= SE luc 


and this works for all f. That is, a u-parameter ee vector applied to 
a function gives the u-partial derivative of the function (composed with the 
patch). 
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EXERCISE 2.1. Show the Leibniz Rule (Product Rule) holds. That is, v[fg] = 
viflg + fvigl- 


EXERCISE 2.2. Show v{a] = v!, v[y] =v, v[z] = v> where v = (v!, v?, v°). 


EXERCISE 2.3. Suppose M is a surface which is the level set of a function g, 
g(x,y, z) = c. Show that Vg(p) is a normal vector for M at every p € M. Hint: 
take v € Ty(M) and show Vg(p)-v = 0 using the definition of v and the chain rule. 
Hence, for level sets we have another way of obtaining a normal. Try this for the 
sphere g(x, y,z) = 2? + y7 + 27 = R?. 


Important Note 2.4. In calculus, the directional derivative is generally 
defined using a unit vector v. The reason for this is simply that we wish to 
detect the change in a function in the v-direction and we do not want the 
magnitude of v artificially affecting our computation. For this very reason we 
will now take unit normals to surfaces. Let 


Ky x Xy 


xy X Xp] 


Now when we ask how U changes in a given direction, only U’s change in 
direction will be measured — and it is this quantity which will detect the 
shape of the surface. A normal vector U to a surface M takes any point of 
M, p, and assigns a vector U(p) € R® to it. Now, U(p) = (u!(p), u?(p), u3(p)) 
since it is a vector in R?, so we may write 


U = (u', u?, u3) 
or, in the basis e; = (1,0,0), e2 = (0,1, 0), e3 = (0,0,1) for R? 


U =u'le, + u*en + u%e3 
3 
= S- ue; 
i=l 


where u!, u?, u® are functions from M to R. Because U assigns vectors to 
each p € M, we say that U is a vector field on M. Similarly, any assignment 
of a tangent vector to each point of M which varies smoothly over M is also 
called a vector field on M. 

How should we describe the change in U in a direction v € T,(M)? We 
can simply look at the changes in the functions u!, u?, u? in the v-direction. 
That is, we consider the directional derivatives of the u* in the direction v. 
Of course, we are only interested in the initial rate of change of U in the 
v-direction because v is situated at p, so we evaluate derivatives at 0 (for 
a(0) = p). We give the name covariant derivative to this initial rate of 
change of U in the v-direction. Although the notation is somewhat strange, 
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remember that the covariant derivative is just the usual directional derivative 
applied to each coordinate function of U. This is denoted by, 


vues eb) 
= Y vive: 


From the definition of v[-], we see that 


V.U= ar (u'(a(t)) \lexo Gi 


where a(0) = p and a’(0) = v. Notice that, if U(t) is defined along a curve 
a(t), then the chain rule applied to each coordinate allows us to write U; = 
aU = = Va'(t)U.- From our discussion above, we know that VU tells us how M 
curves in the v-direction. That is, V(.)U tells us about shape. We, therefore, 
define, 
Sp(v) = -VVU 


to be the Shape Operator (or Weingarten Map) of M at p. The negative 
sign in the definition of S, is simply a convention which will allow us to give 
“positive” and “negative” curvature appropriate meanings. Also, note that 
choosing —U as normal reverses the sign of S as well. S is called an operator 
because of the following 


Lemma 2.2. S, is a linear transformation from T,(M) to itself. 


Proof. Recall that T: V — V is a linear transformation of the vector space 
V to itself if it has two properties: (1) T(vi + v2) = T(vi) + T (v2) and (2) 
T(c:-v) =c-T(v) for any constant c. To prove the lemma, the first thing 
we have to show is that S,(v) is another vector in T,(M). To show this, it 
suffices to prove that S,(v) is perpendicular to U (since being perpendicular 
to U at p defines T,(M)). Now U-U = (u’)? + (u?)? + (u3)? = 1 since U is a 
unit vector field and the usual Leibniz rule gives v[U -U] = 2V,U -U (show 
this!). Thus, for any v, 0 = v[1] = v[U-U] = 2V,U -U. Hence, VyU -U =0 
and V,U is perpendicular to U. 

To show S, linear, we must show that (1) S,(av) = aS,(v) for a € R and 
(2) S,(v + w) = S,(v) + S,(w). Property (1) follows immediately from the 
definition of VyU. Property (2) follows once we know that, for any function 
f, (v+w)[f] =v[f] + wl[f]. By the following exercise, we are done. ol 


EXERCISE 2.4. Let a(t) = x(u(t), v(t)), G(t) = x(a(t), o(t)) with a(0) = p = B(0) 
and a’(0) = v, 6’(0) = w. Show that '(0) = v + w for y(t) = x(u(t) + a(t), v(t) + 
v(t)). Then, using the gradient formula, show that (v + w)[f] = v[f] + w[/]. 
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Because S,: T,(M) — T,(M) is a linear transformation, we may bring to 
bear all the tools of linear algebra to analyze it. In particular, S, has a matrix 
form (which we discuss below) and standard invariants such as determinant, 
trace and eigenvalues have deep geometric significance. In this sense, the 
shape operator is the most fundamental of all the tools used to study the 
geometry of surfaces embedded in R?°. 


Example 2.5: M is a plane in R®. 

We know U = )u'e; is constant. Because the u’ are constant, v[u’] = 0 for 
all v. Hence, S,(v) = —VyU = — >> v[u'Je; = 0 for all v. This makes sense 
since a plane is flat — that is, the shape operator detects no “shape.” 


Example 2.6: $?(R) is the sphere of the radius R. 
We shall use the standard parametrization of the R-sphere given by x(u,v) = 
(Rcosucosv, Rsinucosv, Rsinv) with tangent basis vectors, 


x, = (—Rsinucosv, Rcos ucosv, 0), 


x, = (-Rcosusinv, —Rsin usin v, Rcos v). 


Also, the unit normal is given by U = (cosucosv, sinucosv, sinv). To 
understand the shape operator S on S?(R) it suffices to know what S' does to 
the basis {x,,,x,}. From our discussion above we know that x,[f] = oh =a 
and similarly for v. Therefore, 


S(x,y) = —Vx,U 


= —(x,[cos u cos v], X,,[Sin u cos v], x, [sin v]) 


—(— sin ucos v, cos ucos v, 0) 


Sey 
A similar calculation shows $(x,) = —7. Thus, the shape operator multiplies 


every tangent vector by -F- The corresponding matrix is -F I, where I is 
the 2 x 2 identity matrix. 


EXERCISE 2.5. Let M be the cylinder x? + y? = R? parametrized by x(u,v) = 
(Rcosu, Rsin u,v). Show that the shape operator on M is described on a basis by 
S(xu) = —#Xu and S(xy) = 0. Therefore, in the u-direction the cylinder resembles 
a sphere and in the v-direction a plane. Of course, intuitively, this is exactly right. 
Why? 


EXERCISE 2.6. Show that the shape operator S for the torus 


x(u,v) = ((R+ rcosu)cosv,(R + rcosu)sinv, rsinu), on a basis, is given by 
S(xu) = —** and S(xv) = wypcosu Xv- 
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EXERCISE 2.7. Show that the shape operator S for the saddle surface z = ry 
parametrized by x(u,v) = (u,v, uv) is given on a basis by 


_ 1 2 
S(xu) = ae et ie gs 
(1+ wu? +v2)2 (1+u? + v2)2 
1+u? uvU 
S(xv) = Xa 


— ee CX Sd 
(1tu2+v2)8 (14-02 422)8 


We have already seen that a plane has zero shape operator. Intuitively, 
since the shape operator detects the change in the unit normal U, a zero 
shape operator for a surface M should imply that M is a plane. This is 
verified by the following result. Notice that the algebraic condition on the 
shape operator must be translated into a geometric restriction on M — a 
restriction which characterizes a plane. Recall that a plane is described by a 
point p and a normal vector U which we think of as originating from p. Then 
the plane is given by the collection of all points q with (q — p)-U = 0. 


Theorem 2.3. If S, = 0 at every p € M, then M is contained in a plane. 


Proof. Fix p € M with unit normal U(p) at p and take an arbitrary point 
q € M. We will show that q is in the plane determined by p and U(p). That 
is, we will show (q — p)- U(p) = 0. Since q is arbitrary, then all of M will be 
in this plane. Take a curve a in M with a(0) = q and a(1) = p and define a 


function 
f(t) = (q—-a(t)) -U(a(t)). 


The product rule and the fact that Le = Va(t)U allow us to compute the 
derivative of this function. 


f'(t) = —al(t) -U(a(t)) + (q—- a(t) - Vary (a(t) 
= 0 


since a’(t) € TatyM is perpendicular to U(a(t)) and VaityU(a(t)) = 
—St)(a’(t)) = 0 by hypothesis. Thus, since the derivative of f is zero, 
the function f(t) is constant. To see what constant this is, we can evaluate f 
at 0. But clearly f(0) = 0 since a(0) = q, so (q—a(t))-U(a(t)) = 0 for all t. 
In particular, for t = 1 we get (¢ — p)- U(p) = 0 and we are done. O 


2.3 THE LINEAR ALGEBRA OF SURFACES 


Now, it should be clear that even for slightly complicated examples, an 
exact global representation for S on a surface M may be difficult to achieve. 
Even if we do obtain an exact representation of the shape operator (on a basis 
say), the form of the representation may not give much information (e.g. the 
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saddle above). We can still cull the essential geometry of M, however, from 
computable quantities associated to the shape operator. Therefore, the use- 
fulness of the shape operator itself is derived not from its computability, but 
rather from its theoretical description of geometry in terms of linear algebra. 
With this in mind, we shall prove one theoretical fact about the shape opera- 
tor. The computations in the proof will be of importance later. Before we do 
this, it is worthwhile to recall a few notions of linear algebra. 

Suppose T’: V — V isa linear transformation of the vector space V to itself. 
If we are given a basis B = {x,,...,x,} for V, then T' may be represented by 
a matrix A. This correspondence works in the following way. First, T(x;) = 
7-1 "x; since B is a basis. Second, for fixed i, the a’* may be assembled 
into the i" column of a matrix A. Doing this for each i gives A = (aJ*), an 
n X n matrix which takes e; = (0,...,0,1°,0,...,0) (thought of as a column 
vector and representing x;) to a= (a!’,a’,...,a”*). Here the vectors x; and 
T(x;) have been identified with their respective matrices of coefficients e; and 
a with respect to the basis B. Notice that, while the linear transformation T 
is defined without regard to a chosen basis, its representation as a matrix A 
depends essentially on which basis is chosen. Choosing a different basis for V 
changes the coefficients in the expansion of T(x;) and so a new matrix arises. 
For a linear transformation T, if there is some vector v and real number A 
such that 


T(v) =Av 
then A is called an eigenvalue of T associated to the eigenvector v. If V is 
n-dimensional and there are n linearly independent eigenvectors v1,--- , Vn 
with associated eigenvalues \1,--- ,An, then, in this basis for V, the matrix 
for Tis diagonal (and hence very simple) 
vi 0 
A2 


Therefore, det(T) = []j_, X44 and tr(T) = )77_, Ai. In general, the determi- 
nant of a matrix is the product of its eigenvalues and the trace is the sum of 
the eigenvalues. 

Example 3.1. Let T: R? — R? be a linear transformation with T(1,0) = 
(4,3) and T(0,1) = (—2,-1). The matrix for T with respect to the basis 


{(1, 0), (0, 1)} is 
bee, 


[3 a} [e}=[a8]=[o 3] [2] 


Suppose we have 
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AK 22 a} |0 
3 —1-A]|b}]~ |0]° 
But a matrix can only take nonzero vectors to zero if det(T’) = 0. Hence 


det |*5 aA |=0 


Subtracting, we get 


3 -1- xX 


so (4—A)(—1— A) + 6 = 0 or A? -3A +2 =0= (A-1)(A—2). Thus \ = 1 or 
A = 2. These are the eigenvalues of T. To find the eigenvectors, plug \ back 
into the matrix equation to get, 
3 -—2| la 0 3 
For \ = 1: E Z| sl= lol or 3a—2b=0_ or 5 t= 6. 


This means that any vector of the form [a, 32] = v will satisfy T(v) = v 


(since A = 1). Indeed, take [2,3] and note that 


4 —2/|)2|  |2 
3 -1}/3);  [3]° 
EXERCISE 3.1. Show that the other eigenvector is given by [a, a]. 


EXERCISE 3.2. Show that, if [2,3] and [1, y are chosen as a basis for R*, then the 
matrix for T’ relative to this basis is 


fo 3): 


Finally, note that A,A2 = 2, A; + A2 = 3 and, calculating from the original 
matrix, these are equal to 


4 -2 4 -2 


A linear transformation T: R? — R? is said to be symmetric if T(v)-w = 
v - T(w) for all vectors v and w in R?. 


EXERCISE 3.3. 

(1) Show that, with respect to any orthonormal basis, if the 2 x 2 matrix bE A 
represents a symmetric linear transformation, then b = c. Such a matrix is 
called a symmetric matriz. Recall that a basis is orthnormal if it consists of 
unit vectors which are perpendicular. 

(2) Show that any 2 x 2 symmetric matrix has real eigenvalues. 
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Theorem 3.1. The shape operator is a symmetric linear transformation. 
Further, S(x,) + Xu = Xuu:U, S(Xu)-Xy = Xuv° U and S(x,y) + Xy = Xyy + U. 


Proof. Since {x,, X,} is a basis for T,,(M) we need only show the equation on 
x, and x,. Note first that U- x, =0 =U-xz, since U is a normal to T,(M). 
Now U -x,, and U -x, are functions, so a vector may be applied to them. 


=X; (0) xq [UV = 3G; 


=x yom 
aa Ov 
a Da mu [u*] = sei wx) by the Leibniz Rule 
07x 
= a t 4° Xy U fence ees 
Dd X,[u' Je; -Xy + Aa 
= Vx,0 -Xy + U - Xyy- 
Hence, S,(Xy) +X» = —Vx,U +x, =U -Xyy. A similar computation shows, 


Sp(Xy) - Xu = —Vx,U > xXy =U - Xuy. 


But Xyy = Xyy so the result follows. Finally, the same calculation as above 
shows S(X,)-Xy = Xuu:U and S(x,)- xy = Xp, U. Q 


Corollary 3.2. The shape operator has real eigenvalues. 


There is yet another way linear algebra makes its presence felt in differential 
geometry. Given a map of surfaces F’': M — N, we may define its derivative 
to be a certain linear transformation on all tangent planes F.,: T,(M) —- 
Tr(p)(N). This linear transformation (which we shorten to) F, is defined by 
letting a tangent vector v be represented by a curve a: I — M with a(0) =p 
and a’(0) = v and saying 


! def d 
F.(a'(0)) = 4 (Fla(t)) le=0 - 


Now, A(t) = F(a(t)) is simply a curve on N and the righthand-side of the 
definition of F, is simply the velocity vector of @ at F'(p). The geometry of the 
definition may now be seen. In order to understand F’, linearly approximate M 
near p by the tangent plane T,(M). Each vector in T,(M) has a curve whose 
velocity vector is that vector, so use composition with F' to map the curves 
over to N. The curves and their images under F' fill out regions near p and 
F(p) respectively. To see the geometry of even this local mapping, however, 
we again must linearly approximate by taking derivatives of the image curves 
in N. Then, by definition, the velocity vector of the image curve is the image 
of the velocity vector of the original curve in M. 
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We have made a big deal of this notion of the derivative of a surface map- 
ping not only because it is a difficult idea, but also because it plays an im- 
portant role in what is to follow. In particular, as we shall see later, surface 
mappings and their derivatives will give us the right tools to compare sur- 
faces and their geometries. Of course, a linear transformation is determined 
by its effect on a basis, so, given a parametrization M: x(u,v), F. may be 
determined completely by calculating F.(x,) and F,(x,). 


Example 3.2. Let F': R? — R? be a map of the plane to itself. In coor- 
dinates, we may write F(u,v) = (f(u,v),g(u,v)). To calculate F,(x,) we 
take the composition of the u-parameter curve (u,vo) with F, F(u,vo) = 
(f(u, vo), g(u, vo)) and differentiate with respect to u to get (fu,gu). A simi- 
lar calculation for the v-parameter curve gives 


F, (Xu) = (Sus Gu) and F,(Xy) = (fu, gu). 


By what we have said about the correspondence between linear transforma- 
tions and matrices, we see that, with respect to the basis {x,,x,}, the matrix 


for F, is 
J(F) — i i : 
Ju Qu 


This is the Jacobian matrix of several variable calculus. 


The derivative mapping of a surface map is, therefore, just a natural exten- 
sion of the Jacobian from maps of Euclidean spaces to maps of surfaces. Now, 
the shape operator is a linear transformation of the tangent plane, so we might 
well ask if there is a surface mapping whose derivative is +S? The answer is, 
in fact, yes. The mapping whose derivative is +S is called the Gauss map and 
its behavior provides a geometric alternative to the linear-algebraic approach 
of the shape operator. The Gauss map is a mapping G: M — S? from the 
surface M to the unit sphere S? given by G(p) = U(p), where U(p) is the unit 
normal of M at p. Since U(p) is a unit vector in R°, we may represent it as 
a point on S*, so this definition makes sense. Now, the Gauss map has an 
induced derivative map G,: T,M — Tgp)S? and, by definition, 


G.lv) = Z(Glalt)) reo 


= 5 (U(alt))|e=0 
= Va'(0)U 

= V7 

= ~S(v), 
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where a(0) = p and a’(0) = v. One thing needs to be made clear. Namely, 
the shape operator has range T;,(M) while the Gauss map has range Tg,»)S?, 
so how can G, = —S? We must look at the geometry of the situation for 
the answer. The tangent plane, no matter where we visualize it meeting 
the surface, is a vector space and, therefore, passes through the origin. The 
plane T,(M) is the plane through the origin perpendicular to U(p). For any 
point g € S?, the tangent plane T,(S*) has the beautiful property that it is 
perpendicular to q itself, thought of as a vector in R°. Because G(p) = U(p) by 
definition, the plane T¢,,)(S”) is the plane through the origin perpendicular 
to U(p). That is, T,(M) = Tep)(S?). This explains how the computation 
above can make sense. We will see later that the Gauss map has many uses. 
For now, try 


EXERCISE 3.4. Compute the Gauss map and its derivative for the cone x(u,v) = 
(vcos u, vsin u,v) and estimate the amount of area the image of the Gauss map takes 
up on the sphere. 


EXERCISE 3.5. Compute the Gauss map and its derivative for the cylinder x(u, v) = 
(Rcosu, Rsinu,v) and estimate the amount of area the image of the Gauss map 
takes up on the sphere. 


EXERCISE 3.6. Compute the Gauss map and its derivative for the catenoid x(u, v) = 
(u, cosh u cos v, cosh usin v). Show that the Gauss map is a one-to-one map from the 
catenoid to the sphere. Hint: focus on the first coordinate first. 


EXERCISE 3.7. Compute the Gauss map for Enneper’s surface x(u,v) = (u— - + 
uv, v— - + vu, ue = v’) and show that it is a one-to-one map from Enneper’s 
surface to the sphere. Also show that the image of the disk {(u,v)|u? + v? < 3} 
under G covers more than a hemisphere of the sphere. Hint: first, write U in polar 
coordinates by letting u = rcos@ and v = rsin@ and, then focus on the third 
coordinate. 


2.4 NORMAL CURVATURE 


We would now like to use the shape operator to obtain a notion of curvature 
of a surface. After a very geometric approach to this normal curvature we 
will prove a theorem linking the geometry to the linear algebra of S. First we 
note, 


Lemma 4.1. If a is a curve in M, then a” -U = S(a’)-a’. 


Proof. We will give two proofs of this result. The first reinforces the compu- 
tations above and the second foreshadows the types of calculations we will do 
when we consider geodesics. 
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(1.) We know a’ -U = 0 since U is normal to T,(M). Then, 


= i a a! [a Jué + ee a’ a’[u'] by the Leibniz Rule 


since a’ (a(t)) = — tt 


/ 


ie 
a" U4 Vy al, 


Hence, S(a’)- a’ = —VyU -a’ =a" -U. 


(2.) By Lemma 1.1, write a(t) = x(u(t), v(¢)) and a’ = u/x, + v’x, by the 
chain rule. Applying the chain rule again gives 


2 2 
a” = Ul Xu, + Qu! U'Xuy FU Xyy tu" Ky +" Ky. 


Now, x,:U =0 and x,-U =0, so 


a” -U = (u' 2 xXuu + 2u'u'Xuy +0’ "xu U, 


Because S is a linear transformation, we have 


S(a’) +a! = S(u'xXy + v'Xy) + (u'xXu + 0'Xy) 
= (u'S(x,) + v'S(xXy)) - (u'xXy + U'Xy) 
= uw? 5(xy) -Xy + Ulu’ S (xy) -Xy + ulv'S(Xy) «xy + 0/7 S(x,y) Xp 
= ul xy Ubu u'Xyy Ut ulo'Xuy U $0 xXpy -U 
= (ul? Xu 4 2U'U Kay ok v' Xyy) -U 


=a’ .U. O 


We interpret a” -U as the component of acceleration due to the bending of 
M. Of course, we assume that a@ has unit speed so that the magnitude of a’ 
does not affect our measurement. With this in mind, we make the following 


definition. For a unit vector u € T,(M), the normal curvature of M in the 
u-direction is 


k(u) = S,(u)-u 
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Let a be a curve with a(0) = p, a’(0) = u. Then 


K(u) = Sp(u)-u 

= S,(a'(0)) - a’(0) 
a(0) - U(p) 
«(0)N(0) - U(p) 
= (0) cos 8 


i 


where WN is the Frenet normal to the curve a and « is a’s curvature. The 
angle @ is the angle between N(0) and U(p). 


FIGURE 2.16. Normal curvature 


EXERCISE 4.1. The total torsion of a curve a: [a,b] + R° is f 7 dt. Show that a 
closed curve on an #-sphere has zero total torsion. Hints: use the formula above, 
k(T) = Ka cos@, where T = a’ for a unit speed curve a, the shape operator of the 
sphere and differentiate cos? = N-U. According to Exercise 1.3.12, this means the 
linking number of a and a+ eWN is zero. Why is this true geometrically? 


While the normal curvature is a multiple of the curvature of a, we might ask 
if there is a curve o whose curvature is exactly equal to the normal curvature 
k(u). The answer turns out to be yes. 


Proposition 4.2. Let P denote the plane determined by U(p) and u (at 
p € M) and let o denote the unit speed curve formed by PNM with o(0) = p. 
Then k(u) = +k,(0). 


Proof. First we show that o’(0) = u. But this follows since u and o’(0) are 
tangent vectors and u, o’(0) and U(p) lie in a single plane. Namely, the 
only way for u and o’(0) to both be perpendicular to U(p) and in P is for 
o’(0) = tu. Take a parametrization with o’(0) = u. Now o’s normal N,(0) 
is perpendicular to T,(0) = o’(0) = u and N,(0) is in the plane P since o 
ia a plane curve. Thus, N,(0) = +U(p). Then cos@ = +1 since 6 = 0 or z. 
Hence, k(u) = +«,(0). 0 
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What does normal curvature tell us? If k(u) > 0, then the normal N,(0) is 
equal to U(p) (i.e. 6 = 0°) and o bends up toward U(p). Hence, so does M 
along o. If k(u) < 0, then the normal N,(0) is equal to —U(p), so, along a, 
M bends away from U(p). If k(u) = 0, then «,(0) = 0. This does not mean 
that o is a line, of course, since we don’t know that «,(t) = 0 for all ¢. But 
it does say that near p the rate of bending is small. 

Therefore, the sign of normal curvature tells us about the bending of 7 
toward or away from its normal in a given direction. Note that by changing 
the normal to —U(p) the signs on k(u) reverse. Therefore, to avoid this 
ambiguity, we must fix a convention for normals. For instance, we could say 
that we will always take outward pointing normals. 

Normal curvature is a function from unit vectors in a plane — that is, 
a circle of radius one — to real numbers. In fact, it can be shown that this 
normal curvature function is continuous. Just as any continuous function on a 
closed interval attains its maximum and minimum, so too does any continuous 
function on a compact (i.e. closed and bounded) set like the circle. Hence, 
there are unit vectors u; and ug such that 


k(u,) = ki = max k(u) k(ug) = ke = min k(u). 


The unit vectors u, and up are called principal vectors and k, and kz are 
principal curvatures. 


EXERCISE 4.2. A curve a: I — M is a line of curvature if a'(t) is an eigenvector 
of the shape operator S for every t € J. This is equivalent to saying that the unit 
tangent Ty is always either u; or ug. Show that a is a line of curvature if and only 
if a’ is parallel to U' = Vq/U along a. Further, suppose a is a plane curve. That is 
a= M0Q P for some plane P. Then show that a is a line of curvature if the angle 
between M and P is constant along a. 


Let’s do an example before we go on just to see how normal curvature may 
be computed. We will make use of the Frenet Formulas. 


Example 4.1. Let M be a saddle surface z = x? — y?. We use the normal 
U given by the gradient of g(z,y,z) = z-—2?+y* =0, 


Cs V9 = (-22, 2y, 1) 


~ |Vgl 4/1 + 422 + 4y2 


Let p = (0,0,0). Then U(p) = (0,0,1). Take u = (1,0,0) for example. The 
curve o determined by the intersection of the plane spanned by (0,0,1) and 
(1,0,0) with M is the parabola z = x? bending toward the normal. Therefore, 
we should find k(u) > 0. Let’s see how to compute explicitly. 
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The plane through u and U(p) is the zz-plane with normal vector u x 
U(p) = (0,—1,0). Hence, y = 0 and a is the parabola z = z* which may 
be parametrized by o(t) = (t,0,t?) say. Then o’(t) = (1,0,2t) and o”(t) = 


(0,0,2). Then k(1,0,0) = +,(0) = “ORC _ 2 = 2 > 0 just as 
we thought. Note. We could find the entire Frenet Frame: T = Gal = 


1 2t : Gt es ; =a 
(shee 0. oes | with T(0) = (1,0,0); B= 2225 = (0,-1,0); N = 
Bx T =(0,0,1) =U with a + sign. This justifies k(1,0,0) = +«,(0). 


EXERCISE 4.3. Find k(u) at p = (0,0,0) where u = (0,1,0). What sign should 
your answer have? 


EXERCISE 4.4. Find k(u) at p = (0,0,0) where u = (45, 5,0). 


Example 4.2. Let M: x? + y? = 1 be a cylinder with p = (1,0,0) and 
U(p) = (1,0,0). A unit vector u € T,(M) has the form u = (0,u',u?) with 
(u1)? + (u2)? = 1. A normal for the plane determined by u and U(p) is 
(0, —u?,u?), so the plane’s equation is z = (u?/u1)y. The intersection of the 


ee with M is the set {( 1—y?,y, (u?/u)y) } for any y. Parametrize o 


by a(t) = (V1 = #?,t, (u?/u!)t) with 
= (-t/Vv1 — £2, 1,u?/u') and o"(t) = (-1/a — 17)3/2 9, 0) ; 


Then we have, 


T(0) = (0,1, u?/u")//1 + (u2/u!)?2, 


B(0) = 0'(0) x o”(0)/|o"(0) x o”(0)| 
= (0, -u?/ul,1)/V1 + (u?/ul)? 
= (—(u?/ut)? — 1/1 + (u?/u")?, 0,0) 
ae 3 1,0, 0) 
since (u!)? + (u?)? = 1. Hence, N(0) = —U(p), so we need a — sign in 
k(u) = —K,(0). Burthar 
_ lo’(0) x o"(0)| _ ((u? Ju)? +1)? _ aye 
O= er OP ~ 0+ wefan ~ 
Hence, k(u) = —(u!)?. This is negative or zero. Now, since u = (0,u', u?) 


is on the unit circle in the yz-plane, max k(u) = 0 occurs when u! = 0 and 
min k(u) = —1 occurs when u! = 1. That is, 
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The cylinder M is flat in the ruling 
directions and bends away from the normal in directrix directions. Indeed, 


u 
the bending is what we might call circular. 


= —1; 


) 


u 
of velocity vector of directrix 


FIGURE 2.18. min; k( 
Although it was defined in terms of the shape operator originally, we have 


seen that normal curvature may be described completely geometrically as 
the curvature of the curve of intersection of the surface with a particular 


EXERCISE 4.5. Work through the example above. 


The corresponding geometry is clear. 
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plane. We will now present a fundamental theorem linking the geometry of 
normal curvature to the linear algebra of the shape operator. Say that a 
point p € M is an umbilic point if the principal curvatures at p are equal 
(i.e. ki(p) = ke(p)). Note that this implies that the normal curvature at p is 
constant. For instance, every point on a sphere is an umbilic point. 


Theorem 4.3. 

(1) Ifp € M is umbilic, then S,(u) = ku where k = ky = ko. 

(2) Ifp € M is not umbilic, then there are exactly two perpendicular unit 
eigenvectors of S, with associated eigenvalues the principal curvatures 
at p. 


Proof. We will prove the theorem by going in reverse. We shall consider the 
eigenvalues of the shape operator and show that these are, in fact, k; and kg. 
Let u; be a unit eigenvector for S with eigenvalue 41; S(u;) = A, uy. Let 
Uz be the unit tangent vector rotated 90° counterclockwise from u;. We may 
write S(u2) = au; +bug. But a = S(ug)- uy = S(uz) - ug = Ay Ws: Ug = 0, 
so S(uz) = Az U2, where b = Ap. Thus, uz is the other eigenvector of S and 
is perpendicular to u;. Also, k(u;) = S(u;)-u; = Ay u;- uj; = 4, So A; and 
Ag are normal curvatures at p as well. 

Note that, if A; = A2 = A, then S(u) = Au for all u and p is umbilic. 
So, suppose without loss of generality that A2 < A; and take a unit vector u 
which may be written as u = cos 4 u; + sin @ ug, where @ is the angle between 
u and u,;. We may now compute the normal curvature at u, denoting the 
dependence of u on @ by writing k(6). 


k(@) = S(u)-u 

= S(cos@u,; + sin@ ug) - (cos@ u; + sin 6 uz) 

= (cos @ S(u,) + sin @ S(uz)) - (cos @ u; + sin 8 ug) 

= cos? @ $(u;) - u; + sin cos @ S$(u;) - uz sin cos 6 S(ug) + uy 

+ sin? 6 S(ug) - ue 

= cos @ \; + sin? @ AQ. 
Now write k(@) = A1+(A2—A1) sin? 6 using sin? + cos? 9 = 1. Since Az < A1, 
k(@) is a maximum when 6 = 0; that is, when u = uy. Hence, ky = kmax(0) = 
k(uy) = AI. 

Similarly, we may write k(@) = (A; — Az) cos? @ + Azg and, since A2 < Aj, 


k(@) is a minimum when 0 = 4. That is, k(@) is a minimum when u = upg. 
Hence, kz = kmin(0) = k(u2) = Az. 0 


Corollary 4.4. For u = cos@u, +sin@ ug as above, the normal curvature is 
given by Euler’s formula 


k(u) = cos? 6 ky + sin? 6 ko. 
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2.5 PLOTTING SURFACES IN MAPLE 


At the end of Chapter 1, rather offhandedly, I wrote the MAPLE com- 
mands necessary to plot a sphere and a cylinder together in order to illustrate 
Viviani’s curve. In this short section, I want to describe how MAPLE plots 
surfaces in general. The basic command is “plot3d.” Let’s take Enneper’s 
surface 

x(u,v) = (u— u3/34 uv’, v — v3 /3 + vu", u? — v?). 


MAPLE input which may be used to plot Enneper’s surface is as follows. 

> with(plots): 

> Ennep:=[u-u~ 3/3+ux«v 2,v-v~3/3+v«u 2, u~2 - v~ 2]; 

> plot3d(Ennep,u=-1.5..1.5,v=-1.5..1.5,scaling=constrained, shading=zhue); 


The “shading” option tells MAPLE to use a particular color scheme for plot- 
ting the surface. Different schemes may be tried by clicking on “color” in the 
toolbar which appears when an object is graphed. On the Power PC, you 
should click on the upper left corner of the graphics window to get the menu 
of options. Note that, just as for “spacecurve,” the square brackets do not 
enclose the domain of the parameters. In the plot command above change 
the bounds of the parameters u and v to u=-1.9..1.9, v=-1.9..1.9. What is 
different? Now replace shading=zhue by shading=XY or shading=zgrayscale 
to change the color scheme at the command line instead of from the toolbar. 


EXERCISE 5.1. Graph the hyperboloid of one sheet according to the parametriza- 
tions given in Exercise 2.1.13. Experiment with different bounds on the parameters 
as well as plotting options. 


MAPLE may be used to conveniently create surfaces from particular data. 
For example, a ruled surface x(u, v) = G(u) + v6(u) may be defined by spec- 
ifying G and 6. 


> rulsurf:=proc(beta,delta) 
> local x1,x2,x3; 


> x1:=beta/1]+v*delta/1]; 
> x2:=beta|2]+v*delta[2]: 
> x3:=beta[3]+v*delta{3}; 
> [x1,x2,x3]; 

> end: 


A particular type of ruled surface is a tangent developable. Given a curve 
G(u), its tangent developable is defined to be the surface parametrized by 
x(u,v) = B(u)+v6'(u). We may create and graph the tangent developable of 
the helix (for instance) (uw) = (cos u,sinu, u). See Figure 3.1 for the result. 
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> bb:=[cos(u),sin(u),u]; dd:=diff(bb,u); 

> helixtandev:=rulsurf(bb,dd); 

> plot3d(tandev,u=0..6*Pi,v=0..6,scaling=constrained,grid=[40,6]); 
EXERCISE 5.2. Create a ruled surface with @(u) = (0,0,sin(ku)) and 6(u) = 


(cosu,sinu,0). Plot this ruled surface for varying k. What do you notice? See 
Figure 2.19 for the case k = 1. 


FIGURE 2.19 


EXERCISE 5.3. Write a MAPLE procedure to create a parametrization for a surface 
of revolution given a plane curve a(u) = (g(u),h(u)). Use your procedure to plot 
surfaces of revolution such as the catenoid and the torus. 


EXERCISE 5.4. Use the two procedures above to create ruled and revolution surface 
parametrizations for the hyperboloid of one sheet a? + y" = gS 1, 


EXERCISE 5.5. Write a procedure to create a Monge parametrization for any given 
input function. 


Chapter 3 
CURVATURE(S) 


3.1 INTRODUCTION 


As we have seen, objects such as the shape operator and the principal 
curvatures give a great deal of information to us about the geometry of a 
surface. We have also seen, however, that it may be difficult or impossible 
to actually compute these objects precisely. In this chapter we will introduce 
two computable “invariants” of a surface which are associated to the shape 
operator via linear algebra. In the next chapter we will then consider various 
results which indicate the kind of information these objects provide. 

The two most basic linear algebraic invariants associated to a linear trans- 
formation are its determinant and its trace. Because the shape operator at a 
point p is a linear transformation, we may define two geometric quantities in 
terms of the shape operator’s determinant and trace. 


Definition 1.1. 

The Gaussian curvature of a surface M at p € M is defined to be K(p) = 
det (Sp). 

The mean curvature of a surface M at p € M is defined to be H(p) = 
3 trace(Sp). 


These two quantities are fundamental to the study of the geometry of 
surfaces, as we shall see. Moreover, even though K and 4H are defined in 
terms of the shape operator, we will see that they may be calculated by 
calculus alone. Therefore, we are led back to our main premise that linear 
algebra is the bridge which allows geometry to be studied via calculus. We 
have seen that the matrix of the shape operator with respect to a basis of 
principal vectors is given by 


lens kato) | 


with determinant and trace equal to kjk2 and k; + kz respectively. Hence, 


_ ki tke 


K = kyko, H 5 
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Recall that if -U is chosen as a normal instead of U, then k(u) changes sign 
also. Since Gaussian curvature is the product of two such changes, it does not 
change sign. Note, however, that H does change sign under a change of unit 
normal. These observations are important because the sign of K has meaning. 

Suppose K(p) > 0. Since K = k,k2, ky and ke must have the same sign. 
But ki = maxk(u) and kz = mink(u), so k(u) has the same sign for all u. If 
k(u) > 0 for all u then M bends toward U in every direction. If k(u) < 0 for 
all u then M bends away from U in every direction. 


EXERCISE 1.1. Interpret K(p) < 0 and K(p) = 0. What are the pictures? 


Examples 1.2. 

1. For A(p) > 0, consider the elliptic paraboloid z = x? + y? at p = (0,0,0). 
The paraboloid is tangent to the zy-plane at p and, clearly, whichever vertical 
normal is chosen, either all k(u) are positive or all are negative. 

2. For K(p) < 0, consider the hyperbolic paraboloid z = x? — y? at p = 
(0,0,0). Again, the tangent plane at p is the ry-plane. This time, however, 
the surface both bends away and towards either chosen normal. 


EXERCISE 1.2. The case K(p) = 0 may arise in two ways exemplified by (1) a 
plane and (2) a cylinder a? + y" = 1. Discuss these in terms of normal curvature. 


Now we will focus on straightforward ways to compute curvature in terms 
of calculus and linear algebra. Let v and w be linearly independent tangent 
vectors at p € M. This means simply that one vector is not a multiple of the 
other. Since they are linearly independent, v and w form a basis for T,(M) 
and any vector is a linear combination of them. Hence, we may write the 
effects of the shape operator as 


S(v)=av+bw — and S(w) = cv + dw. 
This says that the matrix of S with respect to the basis {v, w} is 
aoc 
b d|- 


Now det(S) = ad — bc = K and tr(S) =a+d = 2H by definition, so 
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S(v) x S(w) = (av + bw) x (cv + dw) 
= ac(v x v) +ad(v x w) 
+ be(w x v) + bd(w x w) 
= 0+ (ad — be)(v x w) + 0 
= det(S)v x w 
=Kvxw 


S(v) x w+v x S(w) = (av + bw) x w+ v x (cv + dw) 
a(v x w) + d(v x w) 
(a+d)(v x w) 

= tr(S)v x w 

= 2Hvxw. 


EXERCISE 1.3 (Lagrange Identity). Show that for vectors v, w, a and b, 
(v x w)- (ax b) = (v-a)(w- b) — (v- b)(w- a). 


Hint: Write out the vectors in coordinates (v', v?, v*) = v etc. 


Now combine Exercise 1.3 with the formulas above to get, 
(S(v) x S(w))-(v x w) = K(v x w)-(v x w) 
(S(v) - v)(S(w) - w) — (S(v) - w)(S(w) - v) 
= K ((v-v)(w-w)—(v-w)(w-v)) 
K = (SY) _-v)(S(w) - w) ~ (S(v) -w)(S(w) -v) 
(v-v)(w-w) —(v-w)(w-v) 


and 
(S(v) x w+v x S(w))-(v x w) = 2H (v x w)- (v x w) 
(S(v) -v)(w- w) — (S(v) -w)(w- v) 
+(v-v)(S(w) + w) — (v- w)(S(w) - v) 
= 2H ((v-v)(w- w) —(v-w)(w- v)) 
Finally, by dividing through, we obtain 
H= 


(S(v) - v)(w - w) — (S(v) - w)(w- v) + (v- v)(S(w) - w) ~ (v- w)(S(w) - v) 
2((v -v)(w-w) — (v-w)(w-v)) | 


90 3. Curvature(s) 


EXERCISE 1.4. Use Euler’s formula (Corollary 2.4.4) to show 
(1) the mean curvature H at a point is the average normal curvature 
1 20 


H= > J (8) a6. 


(2) f= Kvi) + k(v2) for any two unit vectors v1 and v2 which are perpendicu- 
lar. Hint: If the angle from v, to uy is ¢, then the angle from v2 to uy is 


o+F. 


EXERCISE 1.5. Show that the principal curvatures are given in terms of K and H 
by 

ky =H+VH?—K and ky =H-WVH?-K. 
Hint: k, and kg are the eigenvalues of S. Find these for S = E | by setting the 


characteristic polynomial A? — tr(S)A +det(S) equal to zero and using the quadratic 
formula to solve. 


There is another more geometric way to view Gaussian curvature in terms 
of the Gauss map G: M — S?. We have seen in Chapter 2 that the derivative 
of the Gauss map is the negative of the shape operator, G,(v) = —S(v). Take 
the basis {x,,,x,} and consider 


G(Xu) X Ga(Xv) = (—S(Xu)) x (—S(xy)) 
ST Ky KR. 


Now, |G,.(Xu) x G.(xX,)| and |x,, x x,| may be thought of as infinitesimal pieces 
of area of the image of the Gauss map on S* and M respectively. The formula 
above then says that the ratio of these infinitesimal areas is precisely |K]. 
Another way to say this is the following. Let U be a small open neighborhood 
of p € M and suppose that U/ is contracting down to p. Then 


_,, AreaG(U) 
fe Asean 


This expression means that the magnitude of Gaussian curvature measures 
the way in which the unit normal expands or contracts area. The Gauss 
map will be of greater importance to us when we study the complex-analytic 
approach to minimal surfaces in Chapter 7. With this in mind, we introduce 
some convenient terminology. A surface M is said to be flat if K(p) = 0 for 
every p € M and it is said to be minimal if H(p) = 0 for every p € M. We 
will consider minimal surfaces extensively in two later chapters and we will 
see exactly where the term “minimal” comes from. The term “flat” is derived 
from the fact that the prime example of a flat surface is the plane. We have 
already seen that the shape operator of a plane is identically zero, so it is 
immediate that its determinant, the Gaussian curvature, is identically zero as 
well. It may not be so obvious however that other surfaces may be flat. 
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EXERCISE 1.6. Use your knowledge of the shape operator for the cylinder x” +y? = 
R? to show that a right circular cylinder is flat, but not minimal. 


EXERCISE 1.7. Show that if M is minimal, then K <0 on M. 


EXERCISE 1.8. Show that the R-sphere S7(R) has K = 1/R?, first by considering 
the determinant of the R-sphere’s shape operator and, secondly by considering the 
Gauss map of the R-sphere and how area changes. 


3.2 CALCULATING CURVATURE 


The formulae for A and H for general v and w may be particularized to 
xX, and x, when a patch x is given for M. With this in mind we introduce 
the following traditional notation: 


B= Xi *% ba 8 (Ky) Ry 
F=xXy-Xy m = S(Xu)-Xy = S(Xv) «Xu 
G = Xy °Xy n= S(x,) Xy 


Then, replacing the general v, w by x,,, X,, we have the two curvature for- 
mulas 


(S(Xu) *Xu)(9(Xv) - Xv) — (S (Xu) - Xy)(S (Xv) - Xu) 


ade (Xu : Xi} OG ; x5) al (x5 : 9am | a ; Xx) 
_ In-m? 
- EG — F? 


BU (au 2, oty Re) — Otago Moke ey )) 
_ Gl+ En —2Fm 
- 2(EG — F?) 


Remark 2.1. The quantity EG — F? has already made an appearance as 
[xu x X,|*. See Exercise 1.3.4 (Lagrange’s identity). 


Now we come to the fundamental result which allows for computation of 
Kk and H without reference to the shape operator S. The result follows 
immediately from Lemma 2.4.1, but we write out the proof here explicitly 
because it is so important. 
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Lemma 2.1: 0=U0 + Xiu, THE UX TSU Xe 


Proof. We shall prove the formula for m. We know U - x, = 0 since U is 
normal and x, € T,(M). As we have seen previously, 


Ox; 
= Au U . uv) = *u a 
O=x, (0 x) =x ee 
= Dd xul Fae at 2 teu Ea 
=Vx,0-x,»+U-Xuy since xy, E = ana 
= —S(Xy)-Xy +U - Xuy. 
Hence, S(x,y) + Xy =U -Xwy. O 


EXERCISE 2.1. Prove the formulas for | and n. 


EXERCISE 2.2. Suppose x(u,v) and y(r,s) are two patches for the same surface 
M. For y_! ox(u, v) = (r(u, v), s(u, v)), show that 


ExGx — F2 = [rusy — rvsul"(EyGy — F2) 


Ixnx — ma, = = [ruse = rysul? (lyny — my) 


and, consequently, Kx = Ky. 


EXERCISE 2.3. Suppose x(u,v) is a patch. Define a new patch by y(u,v) = 
cx(u,v), where c is a constant. Show that 


EXERCISE 2.4. From a surface M: x(u,v), construct a parallel surface M ue 
x'(u,v) = x(u,v) +tU(u,v). Show that the Gaussian and mean curvatures are 
given by the formulas 


kta K yte H — Kt 

1—2Ht+ Kt? 1—2Ht+ Kt? 

where K and H are the Gaussian and mean curvatures of M. Then show the 
following: (1) if M has constant mean curvature H = c, then M 3¢ has constant 
Gauss curvature K2¢ = 4c”, (2) if M has constant Gauss curvature K = c”, then 
M® has constant mean curvature He = =. Hint: compute E‘Gt — Ft? = Ix’ x 
x4,/? = [1 — 2Ht + Kt?|?(EG — F*) directly from the definition of x'(u,v) and 
compare. We will use parallel surfaces when we discuss minimal and constant mean 
curvature surfaces in Chapter 4. 
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EXERCISE 2.5. Let M: x(u,v) be a surface. Recall that a curve a: I —~ Misa 
line of curvature if the tangent vector a’(t) is an eigenvector of the shape operator 
for each ¢t. Show that all u and v-parameter curves are lines of curvature if and only 
if F =O and m= 0. 


Example 2.2: Enneper’s surface. Let M denote Enneper’s surface 
3 3 
x(u,v) = (u— = +uv?,u- > + vu’, u? — v?). 
Then, x, = (1—u*+v?, 2uv, 2u), xy = (2uv,1—v?+u?, —2v) and we compute 
E, F and G as 
BS Ky Ky 
= 1+ 2u? + 2u? + ut + 2u?v? + v4 
=(1+u? +07)’, 


BS Xie 
= Quv — 2u3v + 2uv?® + 2Quv — 2uv? + 2u3v — 4uv 
= 0, 

G = Xy + Xy 


= 4u?v? +1 —0? +? — 0? + 4 — 2? +? — uv? + uf? + 40? 
= 14 Qu? + Qu? + ut + 2u?v? + v4 
= (1+u? +)’. 
The unit normal U is found by taking 
Xy X Xy = (—4uv? — 2u+ Quv? — 2u3, 
Qu — 2u2v + Qu? + 4u7v, 
1 — (u? — v?)? — 4u?v?) 
= (-2u(1+ u? + v?), 2u(1 + u? + v?), 1 — (u? + v?)?) 
with length squared 
[xu x X_|? = 4u2(1 + u? + v7)? + 402(1 + u? + v?)? 
+ (u2 + v?)t — 2(u? +0)? 41 
= A(u? + v*) + 8(u? + v7)? + 4(u? + v?)3 
+ (u? +v?)4 — 2(u? +07)? 44.1 
= (u? +0")? + 4(u? + 0?)3 + 6(u? + v?)? + 4(u? + 0?) +1 
= (u*+v*+1)4 
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which also may be computed by noting the consequence of the Lagrange iden- 
tity, |x, x X,| = VEG — F?. Then, 

7 —2u 2u 1 — (u? + v?)? 

~ \ur+u241? wrto2+1? (w+? +12 /° 
The second partials Xy, = (—2u,2v,2), Xu = (2v,2u,0) and x, = 
(2u, —2v, —2) then give 

> copra oi 

Au? ‘ Ay? i 2 — 2(u? + v2)? 

utter. urtu2t 1 (u2+v2 41)? 

_, 2(u? +0? +1)? | 

~ (u2 +02 +1)? 

= 2. 


Similarly, n = xX,,-U = —2 and, clearly, m = xy,-U = 0. Then the Gauss 
curvature of Enneper’s surface is 


In — m? 
at EG — F? 
—4 


and the mean curvature is 


Gl+ En—2Fm 


= EG — F2) 


_ (u? + uv? + 1)?(2) + (u? + v? + 1)?(-2) -0 
7 2(u? + v2 + 1)4 
= 0. 


Hence, Enneper’s surface is a minimal surface. We will see exactly how its 
parametrization arises in Chapter 7. 


EXERCISE 2.6. Show that the u and v parameter curves of the catenoid and En- 
neper’s surface are lines of curvature. Also, show that these curves are planar (Hint: 
calculate torsion). Finally, show that the following surface also has parameter curves 
which are planar lines of curvature: 


cuxtsinucoshy v+ccosusinhv 
x(u, v) Pe eae ey ——————rY <5 yea 
V1 —c? Vi-c 
Hint: use MAPLE. Enneper’s surface, the catenoid and this last surface are the only 
minimal surfaces with planar lines of curvature. 


, +cosucosh v) : 
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2 


EXERCISE 2.7. Using K = ahs, show that the Gaussian curvature of the 
R-sphere S$7(R) is 1/R?. 


Example 2.3: The hyperboloid of two sheets. Let M denote the hy- 
perboloid of two sheets 


parametrized by x(u, v) = (asinh ucosv, bsinh usin v, ccosh u). Then, 


X, = (acosh ucos v, bcosh usin v, csinh wu) 


Xy = (—asinh usin v, bsinh ucos v, 0) 


and xy X Xy = (—be sinh? ucos v, —acsinh? sin v,absinh ucosh wu). Dividing by 
|Xu X Xy | gives 


y= Xy, X Xy 
W 
where W = \V/b2c? sinh’ u cos? v + a2c2 sinh’ sin? v + a2? sinh? u cosh? u. 


We then have 


2 3 ‘ 
E = a* cosh” ucos* v + b? cosh? usin? v + c? sinh? U, 
F = —a’ sinh ucosh usin v cos v + b’ sinh u cosh usin v cos v 


G = a’ sinh? usin? v + b? sinh? ucos? v 
with 


EG — F? = bc? sinh* ucos? v + a2c? sinh’ sin? v + a?b? sinh? ucosh? u 


=Ww?. 
The following second partials then give |, m and n. 


Xuu = (asinh ucos v, bsinh usin v, ccosh u) 
Xuy = (—acosh usin v, bcosh ucosv, 0) 


Xyy = (—asinh ucos v, —bsinh usin v, 0) 


96 3. Curvature(s) 


l= xu U 


—abcsinh? u cos? 


v — abcsinh? usin? v + abcsinh u cosh? u 
WwW 

abc sinh u : : 

————. using 1+ sinh? u = cosh? u, 


W 


N= Xiu 


abcsinh? ucosh usin v cosv — abcsinh? ucosh usin v cos uv 
WwW 


abcsinh® u cos? v + abc sinh? u sin? v 
WwW 
abc sinh? u 
WwW 


Hence, we obtain the Gauss curvature 


ae In—m? _ a®bc? sinh* u 
~ EG — F2 — w4 


which we may write as 


w2 \? 
K =1/ {| ———-- 
/ (——] 


where 


w2 C ac ab 
Se sinh” u cos? v + — sinh? usin? v + — cosh? w. 
abcsinh“u = @ b Cc 


Now, the coordinate functions of the parametrization are x = asinhucosv, 
y = bsinhusinv and z = ccoshu, so the reader can check that the Gaussian 
curvature may be written in terms of x, y and z as 
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EXERCISE 2.8. Show that the Gaussian curvature of the hyperboloid of one sheet 
x(u, v) = (acoshucosv, bcoshusinv, csinh u) 


may be written in Cartesian coordinates as 


Hint: MAPLE may be useful here. 


EXERCISE 2.9. Show that the Gaussian curvatures of the elliptic and hyperbolic 
paraboloids (respectively) 


2 2 2 2 
_ 2 y _ 2 y 
a ne + be and z= az _ b2 
may be written in cartesian coordinates (respectively) as 
1 
2 2 
4a2b2 E +44 1| 4a2b2 E +H+ 1| 


Hint: MAPLE may be useful here. 


EXERCISE 2.10. Find the Gaussian and mean curvatures of the helicoid parame- 
trized by x(u, v) = (ucos v, usin v, bv). 


EXERCISE 2.11. Find the Gaussian and mean curvatures for the saddle surface 
M: z= ay. Take a Monge patch x(u, v) = (u, v, uv). 


EXERCISE 2.12. Compute the Gaussian curvature of Kuen’s Surface x(u, v) = 


2(cosu+usinu)siny 2(sinu—ucosu)sinv 7) 2cos v 
ee ee en) 
1+ usin“ v 1+ u?sin*v 2° 1+ u*sin 


EXERCISE 2.13. 
(a) Show that a ruled surface x(u, v) = @(u) + v6(u) has Gaussian curvature 


_ -(p'-6x &)? 
= 7 
where W = |G’ x 6 + v6’ x 6|. Using ruling patches, 
(b) Compute K for M: z= zy. 
(c) Compute K for a cone x(u, v) = p+ v6(u). 
(d) Compute K for a cylinder x(u, v) = G(u) + vg. 
(e) Compute K for the helicoid. 
(f) Compute K for a hyperboloid of one sheet. 
(g) Compute K for the hyperbolic paraboloid. 
Interpret your computations in terms of the geometry of the surface. Compare your 
calculations with previous ones. 


Kk 
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EXERCISE 2.14. A cone (minus its vertex) is a surface. For the standard cone 
z= ./x2 + y?, compute K and H using the two patches: 
(a) x(u,v) = (u,v, Vu? + v2) (ie. a Monge patch). 


(b) x(u, v) = v(cosu, sin wu, 1) (ie. a ruling patch). 


EXERCISE 2.15. A ruled surface M: x(u,v) = B(u) + v6(u) is developable if its unit 
normal U is constant along the rulings. Hence, U does not depend on v. Show that 
a ruled surface is developable if and only if its Gaussian curvature is zero. See [BM] 
for an industrial application. 


EXERCISE 2.16. Show directly that cones and cylinders have constant normals 
along rulings and, therefore, are developable. 


EXERCISE 2.17. Besides cones and cylinders, typical examples of developable sur- 
faces are ones which arise from curves, the tangent developables. Given a curve 
B(u), its tangent developable is defined to be the surface parametrized by x(u, v) = 
B(u) + vB’ (u). Show that this surface is, in fact, developable. Graph the tangent 
developable of the helix G(u) = (cosu, sin u, u). 


FIGURE 3.1. Tangent developable of a helix 


EXERCISE 2.18. Let G: I — M be a curve on a surface M with unit normal 
U. Show that @ is a line of curvature on M if and only if the surface defined by 
y(u,v) = B(u) + vU(u) is developable. Here, U(u) denotes the normal of M along 


B. 


EXERCISE 2.19. For M: z = f(x,y) 
(a) Give a Monge patch for this surface. 
(b) Compute FE, F, G, l, m, n. 

(c) Find formulas for K and H. 
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(d) Recall that (uo, vo) is a critical point for f if fu(uo,vo) = 0 and fu(uo, vo) = 0. 
Now, (ug,vo) may be a maximum, a minimum or a saddle point. The gnd 
Derivative Test is designed to decide which of these choices pertains. It goes 
like this. First, compute D = fuu(uo, v0) fuv(uo, v0) — (fuv(uo,vo))*. If D = 0, 
then the test fails and no information is obtained. If D < 0, then (uo, vo) is a 
saddle point. If D > 0, then there are two cases: 
(I) If fuu(uo, vo) > 0, then (uo, vo) is’a minimum. 

(II) If fuu(uo, vo) < 0, then (uo, vo) is a maximum. 

(In the above, fyy could equally well be used.) 

Show that this test works! Hints: (1) How is K related to D? (2) Consider K > 
0, K <0, K =0. (3) What do these imply about normal and principal curvatures? 
(4) Recall k(v) = S(v) -v for v a unit vector. If w is not a unit vector, then it can 
be made one by w/|w|. Thus, 


— 


S(w) - w = (S(w/|w|)-w/|wl)iwl? Why? 
= k(w/|wl)|wl?. 


(5) fuu(uo, vo) = U(uo, v0) « Xuu(uo, vo). 


3.3 SURFACES OF REVOLUTION 


We have already seen that many interesting examples of surfaces arise from 
revolving curves about an axis. These surfaces of revolution have patches of 
the form x(u, v) = (g(u), h(u) cos v, h(u) sin v). 


FIGURE 3.2. A surface of revolution 


The curves on the surface which are circles formed by revolving a single 
point about the axis are called parallels (denoted 7). The curves which look 
exactly like the original curve (but rotated out) are called meridians (de- 
noted 1). Also, recall that g(u) is the distance along the axis of revolution 
and h(u) is the radius of a parallel. We have, x, = (g’,h’cosv,h'sinv), 
Xy = (0,—Asinv, hcosv) and x, X Xy = (hh’, —g’hcosv, —g/hsinv). Hence, 


(h’, —g’ cos v, —g’ sin v) 


C= 
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Also, the second partial derivatives are Xu, = (g”,h” cosv, h” sin v), 
Xuv = (0, —h’ sin v, h’ cos v), and xy» = (0, —hcosv, —hsinv). Hence, 


(gh! Pr. h’g') 


E=g +h? ~ al ghcene 
F=0 m=0 
G=h? fy ieee 


ab] gl’? + h’2 ; 
Finally, the Gaussian curvature is computed to be 


pe g' (gh! me hi'g') 
h(g’? + hi)? 


EXERCISE 3.1. Verify the computations above. 


If the original curve a(t) = (g(t), A(t),0) has g/(t) # 0 for all t, then g 
is strictly increasing. Thus, it is one-to-one and has an inverse function g~! 
which is differentiable as well. Such an inverse allows us to reparametrize the 
curve a. Define f = hog! and get 


G(u) = aog™*(u) = (gg~*(u), hg” *(u), 0) 


= (u, f(u), 0). 
Thus, our calculations become somewhat easier. For instance, the formula 
for Gaussian curvature becomes K = —Fal- To avoid confusion, we will 
still write a(u) = (u,h(u),0) and K = —py oa. 


Example 3.1: The Torus. 


FIGURE 3.3. A torus 


x(u,v) =((R+rcosu)cosv, (R+rcosu)sinv, rsinu) 
h(u) g(u) 
cos u 
= r(R+rcosu) 
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Observe that, for -5 <u < 4, we have cosu > 0. Hence, K > 0 on the outer 
half of the torus. For u= —3, 5 we have cosu = 0, so K = 0 on the top and 
bottom of the torus. Finally, for | <u< 3 we have cosu < 0 and K < 0 
on the inner half of the torus. Note that the maximum K occurs at u = 0, 

i! 


tO RF) 


on the outermost circle, while the minimum K occurs at u = 7, 


AO ene 


on the innermost circle. 
EXERCISE 3.2. Verify the formula for K above. 


EXERCISE 3.3. For a surface of revolution parametrized as above, the v-parameter 
curves are parallels and the u-parameter curves are meridians. The normal curva- 
tures in the directions of x» and xy are denoted kz and k, respectively. Show that 
ky = 1/E and kx = n/G by calculating the eigenvectors of S as xy and xy with 
eigenvalues |/E, n/G respectively. This shows directly that meridians and parallels 
are lines of curvature. Hint: S(xu) = axy + bxy. Use F =0, m=0. 


EXERCISE 3.4. For a general surface of revolution, show that 
Mpt ol pit / 
ku = 7 2 , ce 2 : 2 
(g! + h! )3/2 hig! + h! 1/2 
and obtain formulas for K and H. What is ky on a torus? What is the geometric 


reason for this? 


EXERCISE 3.5. Revolve the catenary y = ccosh(£) around the z-axis. Show the 
following: 


kn = —ky, k= 
Can you think of another surface of revolution which is minimal? 


EXERCISE 3.6. For the following surfaces of revolution, find K and describe where 
K>0,K <0,kK=0. 
2 
(a) Revolve a(u) = (u,e~% /?, 0) about the z-axis. 
2 
(b) Elliptical Torus. Revolve the ellipse Gn + ve = 1 about the z-axis. 


EXERCISE 3.7. If a surface of revolution is generated by a unit speed curve a(u) = 
(9(u), h(u), 0), then show (a) E=1, F=0, G=h? and (b) K = —h""/h. 
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Example 3.2. As another example of our computational techniques, we will 
consider a surface of revolution which is analogous to the sphere in the sense 
that it has constant Gaussian curvature. This surface is called the Pseudo- 
sphere and it is defined by means of a condition on the generating curve. 
Namely, let a be a curve which begins at (0,c) and traces a path so that its 
tangent line at any point reaches the z-axis after running a distance exactly 
equal to c. This means the curve a must decrease and flatten. The curve a is 
called a tractriz and is physically represented by the path a ship would follow 
when starting at (0,c) and is being pulled by a tugboat which moves along 
the z-axis. 


FIGURE 3.4. Tangent line segment of length c 


Write a(u) = (u, h(u)) and note h’ < 0 and g(u) = u. Now, a’(u) = (1, h’(u)) 
and a tangent line at (u, h(u)) has the equation 


l(t) = a(u) + ta’(u). 


The y-coordinate of I(t) is then h + th’, so for t = 0 we are at a(u) and for 
t = —h/h’ we are on the z-axis. We want the length of this segment to be c, 
so we have 


c= |a(u) — (a(u) - “a/(u)) 
= la" 


= V1+h?. 


Then c? = fr (1 pr) = by + h?. Solving for h’ and then taking another 
derivative, we get 
h h'c? 
pi a 2 ~ (2 — h2)3/2" 
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EXERCISE 3.8. Show that, for the surface of revolution obtained by revolving a 
about the x-axis, we obtain (a) ky, = h'/c (b) kx = —1/ch’ and (c) K = —1/c?. 
Note that the Gaussian curvature of the pseudosphere is a constant. How- 
ever, in contrast to a sphere’s constant positive curvature, the pseudosphere’s 
curvature is negative. For c = 1, we obtain a surface at each point of which 
the Gaussian curvature is —1. Later we will meet up with more geometrically 
natural surfaces of this kind. 


FIGURE 3.5. The Pseudosphere 


EXERCISE 3.9. Solve the following separable differential equation 


h 
_ . 
V1—h? 


in two ways to get (1) u = nj + i | ~— V1—h? and (2) h(u) = sechu, 
u = w-tanhw. For the first, turn the picture over so we revolve the tractrix about 
the z-axis and the pseudosphere (with c = 1) lies over the disk of radius 1. If we 
write things in cylindrical coordinates (r, 6, z) then the old wu becomes z and the old 
h becomes r. By substitution in the formula above, we get 


z=—-V1—r? —In(r) +In(1+ V1 —r?). 
Of course, rectangular coordinates may now be introduced by r = ve x2 +y2. Use 
this formula to graph the pseudosphere via MAPLE. For the second, we obtain a 
patch for the pseudosphere 
x(w,v) = (w — tanh w, sech wcosv, sech wsin v). 
Plot this patch with MAPLE also. Compare the two plots. Hints: For the first, 


use trigonometric substitution with h = cosw. For the second, use hyperbolic 
trigonometric substitution. 
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The pseudosphere is actually determined by the requirement of constant 
curvature and certain initial conditions. Suppose we start off by requiring that 
a surface of revolution M have constant Gauss curvature K = —1. We can 
suppose also that M is parametrized by x(u, v) = (g(u), h(u) cos v, h(u) sin v) 
with g'(u)? + h’(u)? = 1 (ie. the profile curve is unit speed). Then, as 
we have seen, the formula for Gauss curvature reduces to K = —h’’/h, so 
the requirement of constant curvature K = —1 gives us a linear differential 


equation to solve, 
hi =h. 


The solution to this differential equation is easy. We obtain 
h(u) = Ae“ + Be. 


Then, plugging h(u) into the the unit speed relation g'(u)? + h’(u)? = 1 and 
solving for g(u) gives 


g(u) = a V1— (Ae? — Be-*)? dt. 


In choosing A and B, new surfaces of constant Gauss curvature K = —1 are 
created. 


PORORSSSSSSSSSSSUESESE207777793 


FIGURE 3.6. A constant negative curvature surface of revolution 


EXERCISE 3.10. Let A = 1 and B = 0. Show that this surface of constant Gauss 
curvature K = ~1 is the pseudosphere with c = 1. Hint: let e’ = sechw. 


EXERCISE 3.11. Carry out the same procedure for the case of positive Gauss 
curvature K = +1. What should A and B be to get the unit sphere? See section 7. 


3.4 A FORMULA FOR GAUSSIAN CURVATURE 


So far, we have seen that Gaussian curvature tells us a great deal about the 
geometry of a surface. However, since we compute K using the unit normal 
U, it seems that the precise way in which we situate a surface in R? may 
well change K while leaving the essential geometry unaltered. This would 
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mean that Gaussian curvature would not be an invariant of the geometry of 
the surface. We shall now give a formula for K which obviates the use of 
U and shows that K depends only on E, F, and G, the so-called metric of 
the surface. In fact, although a more general formula exists (see Exercise 4.5 
below), we shall restrict ourselves to the case where F = x, +x, = 0. That 
is, we shall assume that the u and v-parameter curves always meet at right 
angles. In this case we prove 


Theorem 4.1. The Gauss curvature depends only on the metric E, F and 
G, 


0 easter 2 ie) +8 (a) 
2VEG \Ovu\ VEG] Ou\ VEG//— 

Here, we have used the notation 
aE = = (i, -X,) and G,= —G = F (xy -Xy). 
Of course, to prove the theorem, we will show that our usual formula K = 
Jn=m* reduces to the one above. Notice that the formula of the theorem does 
not depend on the particular patch used, but only on the metric coefficients 
E, G (and implicitly F). We will consider this a bit more carefully later 
when we discuss isometries of surfaces, but suffice it to say that, intuitively, 
a surface may be “bent” without stretching and still have the same Gaussian 
curvature. The prime example of this phenomenon is the usual right circular 
cylinder. If the cylinder is cut along a ruling it can be made to lie flat on a 
plane without stretching. This means that, locally, the cylinder and the plane 
have the same geometry. This further explains, geometrically, the vanishing 
of the cylinder’s Gaussian curvature. The dependence of K on the metric 
alone is known as Gauss’s Theorem Egregium. 

We point out two things. First, this result does not mean that two surfaces 
with the same Gaussian curvature are the same geometrically. Again, to make 
this precise we need the notion of isometry, so for the moment we give 


by 


EXERCISE 4.1. Show that the following two patches have the same Gaussian cur- 
vature: 


x(u, v) = (ucos v, usin v, v) y(u, v) = (wcosv, usin v, Inw). 


These surfaces are not isometric as will be shown later. 


Secondly, the Theorem Egregium holds for Gaussian curvature, but not for 
mean curvature. For instance, as we have mentioned, a cylinder and a plane 
are geometrically the same with the same Gauss curvature K = 0, but their 
mean curvatures are quite different. The mean curvature of the plane is zero 
while that of the cylinder is one-half the curvature of the base circle. Thus, 
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mean curvature is not truly an invariant of the surface itself, but depends on 
the way in which the surface sits in R°. 

We now turn to the derivation of the formula in Theorem 4.1. While 
the details of the calculations are quite tedious, keep in mind that we are 
simply finding coefficients for a vector in terms of a particular basis. Because 

= Xyy,°'U, Mm =Xy,-U and n = Xp, -U, we need expressions for Xyy, Xuv; 
and X,, in terms of the basis for 3-space {x,,Xy,U}. Write, 


Xuu =H XutliyxXe +lU (since xuu-U =/ etc.) 
(e*) Xuv = Di Xu + TepXe + mu 


Xyy = Typ Xu tT pypXe + nu. 
Our job is to find the I’s. These are just coefficients in a basis expansion, 


but they are known by the name Christoffel Symbols. We use what we know 
about dot products to determine the I’s. 


Xuu Xu = Ty Xu Xu +0+0 
=T%,E by definition of E 


If we can compute Xyx°X, then we will know 't.,,. This is simply the product 
rule. 


B= Xie Ri 80 By S Mig Ki Kee Kn = 2a Ke. 


Thus, 


E 
Kn Xe as and Wiau=s:- 
Further, x, +x, = 0 so taking the partial with respect to u gives 
O=Xyu'XytXyu°Xuy OF Xun? Xv = —Xu° Xuv- 


Also, E = Xy-X,, so taking the partial with respect to v gives By = 2Xy° Xu» 
and, consequently, EF, /2 = Xu -Xuy = —Xuu + Xv. Moreover, 


Te, = (XuuXv)/G=—-E,/2G and Vy, = Xu Xu/E = E,/2E. 


Continuing on, G = Xy + Xy, 80 G,/2 = Xuy + Xv. Then, since 0 = Xy - Xy, we 
have 


Np King = Keg ey, Swath: I, = Ki eG = Gu 26 
and [¥, = Xv: Xu/E = —G,/2E. 


3.4 A Formula for Gaussian Curvature 107 


Finally, Xy -Xy = G, 80 Xyy -Xy = Gy/2 and TY, = xyy -Xy/G = G,/2G. We 
end up with the following formulas. 


Ey, Ey 
Sc Toba _ Yeh +1U 
ee He eS at 
uv = ORY 9G m 
Xoy = Gus + Oey +nU 
vv 2 U IG UV 
l m 
Uy — ~ exe 2 Ge 
m n 
U, = ~ RX _ Gy 


EXERCISE 4.2. Compute Uy, and Uy as above. Hint: Uy % Vx,U = Axy + Bxv. 
Find A by Vx,U-xu = Axu:Xu = AE and 0 = xu(U-xu) = Vx, U-xutU -xuu.) 


We know that mixed partial derivatives are equal no matter the order of 
differentiation, sO Xyuy = Xuvu, OF Xuuy — Xuvy = 0. This means that the 
coefficients of x,,, X, and U are all zero when Xyyuy — Xuvy iS written in this 
basis. Let’s concentrate on the x,-term. (We again use the product rule 
repeatedly. ) 


2G 2G 
Now replace xy», Xyy and U, by their basis expansions to get, 


EuGy -_ Ey as EyGy as In x ze [ JU 
4EG 2G), 4G G\” 
Ey Gu 


= ot ef ae Rye Gu +m,U +mU 
Xuvu = iD Pe yokes IG ? v Gnu u MU y 
_ Ly, Ey ae Gu GuGu = m? x + [ JU 
4EG 2G), 4G2 G|” 
Because the x,-coefficient of Xu. — Xuvu is zero, we get, 
ms EyGu Ey E,G, | E, Ey Gu GuGy _In- m 
- 4EG 2G), 4G? 4EG \2G/, 4G? G 
Notice the last term! Dividing by E, we have 
In — m? aS. EvGu = f Ey _ E,Gy E,Ey ao Gu e- GyGy 
EG 4E°G E\2G/), 4EG?2 4E°G E\2G/), 4EG?’ 


Of course, the lefthand side is K (since F = 0) and the righthand side only 
depends on E and G. Thus, we have a formula for K which does not make 
explicit use of the normal U. 


EB EB E E. 


Ses se +| 


Xuvu = | bea + | 
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EXERCISE 4.3. Show that the righthand side above is 


“sen (8 (Bin) + (Go) 
2/EG \Ov\VEG/ Ou\VE 

EXERCISE 4.4. Compute the curvature of the sphere of radius R using this formula. 
Does this agree with what the shape operator tells you? 


EXERCISE 4.5. Show, by similar arguments, that, if F ~ 0, the following formula 


holds. 

K= 
; Se a an ee oe a oe 
(BG F3? Fy — 3 E F -|2 E F 
Ge F G & F G 


EXERCISE 4.6. Show that the U-terms of xyuy — Xuvu = 0 and xyvu — Xuvy = 0 
give 


Gu Eu 
ly ina =m (Se se) + BH 
= Ey Gu Eu Ey 
=o (3 su) + IG 
and 
E. G 
ny my =m (52 - $2) + Gu 
nae Gu Ey Gy Gu 
=f on T (3 5G) + 2G 


where H is mean curvature (and F' is assumed to be zero). These are the Codazzi- 
Mainardi equations. What do they say in the case where the surface is minimal (i.e. 
H = 0) and the parametrization satisfies E = G, F = 0? 


3.5 SOME EFFECTS OF CURVATURE(S) 


In this section we will derive some geometric consequences of various con- 
ditions on Gaussian and mean curvatures. Of course, the point of all this is 
simply to show that we have not wasted our time so far in learning how to 
compute K and H. These quantities really do tell us about the geometry of 
a surface. In the next chapter we will focus on mean curvature and see that 
it is intimately related to certain structures in Nature. For now, however, 
our goal is simply to see geometry reflected in the linear algebra and calculus 
represented by K and H. 
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Let us start by considering a completely impractical situation! Namely, 
suppose we knew that every point on a surface was an umbilic point. Recall 
that this means that, at every point, the principal curvatures are equal. Al- 
though this doesn’t mean that, a priori, the principal curvatures are the same 
at every point, this is precisely what we shall prove below. Of course, a direct 
verification of such a hypothesis is out of the question, yet we shall see later 
that this result is useful all the same. We shall use the following 


EXERCISE 5.1. Show that two vectors v = v1Xy + v2Xy + v3U and w = w1xu + 
w2Xy + w3U are equal if and only if the following conditions hold: 


VeXy=W'Xu, VeXy=w-X, v'-UV=w-lU. 
Hint: We do not assume xy: Xy = O here. Use EG= F? +0. 


EXERCISE 5.2. Suppose p € M is an umbilic point (i.e. kj = kg = k at p). Show 
that, at p, 


Hint: Write S(xu) = Axu + Bxy = kxz since p is umbilic. Compute B another 
way: 
l= S(xu)-xXu = AE + BF, m= S(Xu):Xy = AF + BG. 


Solve for B. Do the same for S(x,). 
Theorem 5.1. A surface M consisting entirely of umbilic points is contained 
in either a plane or a sphere. 


Proof. At each point 4 = ™ = = =C, but C may vary from point to point 
EF. G 


in M. Now, by Exercise 5.2 above, we can see that 
Vx,U =—-Cxy, and Vx,U = —-Cxy. 


For example, Vx,U-Xy = —l = -l/Ex, + xu = —l/EE and Vx,U +x, = 


—m = —m/FF = —m/Fx,,: X,. Now the mixed covariant derivative may be 
calculated. 
O7u, Ox; Ox; | Our 0° 2; 
View Vaxu = » dX es du Ov we Oz; Oudv )ex 
SV Vey 


so differentiating above by v (left) and u (right), we get, -—Cyxy, — CXyy = 
—CyXy — CXyy or CyXy, = CyxX,. But x, and x, are linearly independent, so 
we must have C, = 0 = C;,, and this implies that C is a constant. 


Case 1. Suppose C = 0. Then the shape operator is zero everywhere. By 
Theorem 2.2.3, M is contained in a plane. 
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Case 2. Suppose C' # 0. Then consider x(u, v) + 1U and compute, 


so Tina 2 aU) = Xy + =U 


Ou C 
=Xy+ al-Cxal by above 
= Xy — Xy 
= 0. 


In a similar fashion ; : 
— —U|=0. 
[x(u,v) + GU] 


Ov 
Hence, x(u,v) + 4U =p constant. Then |x(u,v) — p| = | — 4| = 4, so every 
ie kara l= 3G 
point x(u,v) lies a distance 1/C from p. Hence, x(u,v) lies on a sphere of 
radius 1/C. 


We note that we have proved the result only for a single patch. However, 
because M is connected, any patch must overlap some other patch. For a 
patch which overlaps x(u,v), the points in the overlap lie on the plane or 
sphere as above. Since the argument above produces a constant C’ on the new 
patch, it must, therefore, agree with the C of x(u,v). Hence, the geometry 
of x(u,v) propagates to overlapping patches. This then continues over the 
whole surface, each patch reproducing the geometry of its neighbors. OC 


Notice that in the situation above, 


_in-m? _4£5(42GC)-BP plEG- PF) PB 
EG — F? EG — F? EG—F? ~ E?’ 


so C = VK and the radius of the sphere is 1/\/K. The next result restricts the 
form surfaces in R? may take in terms of their possible Gaussian curvatures. 
Recall that a surface in R® is compact if it is closed and bounded. Here, 
“bounded” means that the surface may be enclosed by a sphere of sufficient 
size. The term “closed” means that any sequence of points on the surface 
which converges to some point in R® actually converges to a point on the 
surface. In the following result the hypothesis of “compactness” is used to 
ensure that maximums and minimums of functions are actually attained by 
the functions. 


Theorem 5.2. On every compact surface M C R?° there is some point p with 
K(p) > 0. 


Proof. Let f: M — R be defined by f(p) = |p|?. This function is continuous, 
so (since M is compact) f attains its max and its min. Let po be a max for 
f. Now f(po) = |po|?, so po is the farthest point from the origin on M. If 
r = |po|, then M is inside the sphere of radius r as shown: 
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FIGURE 3.7 


Therefore, it seems likely that K(po) > +, the curvature of the r-sphere 
S?(r). We verify this and so prove the theorem. Consider a unit tangent 
vector u € T,,(M) and take a curve a in M with a(0) = po and a’(0) = u. 
Of course the composition f o a still has a max at po (since a lies in M), so 


d ; ; do 
ahd ©@) |rxo = 0 since a max is a critical point 


—z(f oa) |r=o < 0 since po is a max. 


Now f ca(t) = Ja(t)|? = a(t) -a(t), so 4(f oa) = 2a-a’. At t=0, 
0 = 2a(0) - a’(0) = 2p - u. 
But u is any unit vector in T,,M, so p)-u=0 for all u € T,,M. Hence, po 
is normal to M at po. Also, £(f oa) = 2a’- a’ +2a-a” and att = 0, 
2 


is 
Oe Ge 


(f oa) = 2a’(0) - a’(0) + 2a(0) - a’”(0) 
2 u-u+ po: a’ (0) 
>1+po-a"(0) 
—1 > po-a"(0). 
Now, let’s calculate normal curvatures at po. Since po/r = po/|po| is a unit 
normal to M at po, the calculations above give 
k(u) = S(u)-u 
= S(a’)-a’ 
= U(po) - a" 


— PO 
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In particular, both k, and kp are less than or equal to —1/r. Thus, the Gauss 
curvature has 1 


11 
= >-- = ’ 
K (po) ki (po) k2(po) = 75 7) >0 
O 


Corollary 5.3. There are no compact surfaces in R? with K < 0. In partic- 
ular, no minimal surface embedded in R* is compact. 


We now come to one of the main results displaying the influence of curva- 
ture on geometry. Previously, we saw that an all umbilic surface is either a 
plane (K = 0) or a sphere (K > 0 and constant). There is, however, a much 
stronger result which is based on the Gauss curvature K itself rather than its 
factors ky and ko. This beautiful result is 


Theorem 5.4 (H. Liebmann). If M is a compact surface of constant Gauss- 
ian curvature K, then M is a sphere of radius 1//K. 


Proof. By the previous Theorem, there is some p € M with K(p) > 0. Since 
K is constant, then K > 0 everywhere on M. Because K is constant and 
K = ky, ko, where k; and kp are the principal curvatures, if p is a point where 
the function k, is a maximum, then p is also a point where kg is a minimum. 
We know that such a p exists since M is compact and k,, ke are continuous 
functions. We now have two cases to consider. 

Case 1. Suppose ki = ko at p. Because ky is a maximum and kp is 
a minimum at p, all normal curvatures on M must lie between these two 
values. Therefore, if k; = ko at p, the normal curvature k must be constant 
on M. Hence, M is all umbilic with K > 0, so it is a sphere of radius 1/ VK. 
(This is what we meant by saying that Theorem 5.1 would prove useful.) We 
shall now show that Case 2 below is impossible, so that Case 1 proves the 
theorem. 

Case 2. Suppose k; > kz at p. There is actually a small open neighborhood 
of p in M where k, > ko. If not, we could take a decreasing sequence of open 
“balls” about p and choose points a; in the balls with ki(a;) = ke(a;) and 
a; — p. Then, by continuity, we would have k,(p) = ko(p). This contradicts 
our assumption, so we must have the desired neighborhood. Although we shall 
not prove it, on this neighborhood we may choose two principal directions 
which are perpendicular at each point and use these to provide a patch with 
Xy'Xy = BE, xX, Xy = 0, Xy-Xy = G and, since x,, Xy are principal vectors, 


S(x,) = kiXu, S(x,) = koXy. 


(A proof of the existence of such a patch x may be found in [DoC1, p. 185}.) 
Note that m = S(x,)-Xy = kx, +X = 0 also. Moreover, 1 = S(Xu)-Xu = 
kiX, + Xy = ky E, so ky = + and, similarly, kg = &. Of course, we have the 
formula, 


« =— 5750 Le (aa) * oe (Vets) 
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and from the U-term of Xyu» — Xuvu = 0 (see Exercise 4.6), we get 
—nEy E,l 
ag 1" aE 
Bit n 
aaar) ls : a | 


Ey 
= “5 (ht + ke). 


Similarly, using the U-term of Xyyy — Xvuy = 0 we get, 


G, | l n G., 
tu => E as 4 = “> (kt + k2)., 


If we differentiate the formulas k,; = 4 and kp = ra repeatedly and use the 


expressions for |, and n, we get, 


1 E, 
hips Fa (Ele -—-E,l)= op (ke — ky) 
1 Gu 
kou = Ga (Gnu = G,,n) = yeu al k2) 
E 
kivy = on (t = ky) +e Ey (. os ) 
kouu = Guu (, — ke) + Gu(..-) 
2uu — IG 1 2 Uulewe de 
Now, at p, k; is a max and kg is a min, so 
Kivy = 0 kouu > 0. 
Plugging these estimates into the equations above we get, at p, 
E, =0 G, = 0 
Evy = 0 Gas > 0. 


Note that, even though E, = 0 = G,, at p, the expressions 


2 (in) ot 2 (Se) 

Ov EG Ou EG 
may be nonzero at p since the differentiation is done before the evaluation at 
p. These expressions then give us, at p, 


1 
K = -—— (E wi oe OS 
SEG (Bow + Gua) S 
But this contradicts our hypothesis that K > 0, so our assumption k, > ke 
must be incorrect. Hence Case 1 holds and M is a sphere. 0 


Notice that in the course of the proof we have discovered a Lemma of David 
Hilbert which will prove useful in a later exercise. 
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Lemma 5.5 (Hilbert). If k; has a max at p, kp has a min at p and k, is 
strictly greater than ko at p, then K(p) < 0. 

So far, we have concentrated on the effects of Gaussian curvature on geom- 
etry. We now balance this somewhat by giving a result which indicates the 
effect mean curvature has on the geometry of a surface. Recall that a surface 
is minimal if the mean curvature is zero at each point of the surface. Just as 
for other geometric qualities, we may test the notion of minimality on classes 
of surfaces which we understand best. In Chapter 4 we shall consider other 
types of minimal surfaces, but for now let’s look at the surfaces of revolution. 


Theorem 5.6. If a surface of revolution M is minimal, then M is contained 
in either a plane or a catenoid. 


Proof. For simplicity, we will take the special case of a patch x(u,v) = 
(u, h(u)cosv, h(u)sinv) for M. Then 


ya. kuthke 1 SRE ao 1 _ 1 f-hh"+14h? 
2 2K + h?)8/2 "A + h)2 7 2) a(t ni?y3/2 J 
The surface M is minimal, so H = 0 and, thus, hh” =1+h’?. Let w =h’. 


oe _ dwdh — dw 
~ dhdu dh” 
Here we consider w as a function of h on an interval with h’(u) # 0. We 
can do this because, on such an interval, h has an inverse function f with 
u = f(h). Then, taking a derivative and applying the chain rule gives 
dfdh_ df 
+= Ghdu ~ dh” 


hh” _ wy’ 


or w=, 


si + 


so that w is a function of h since f and df /dh are. Thus, hh” = 1+h’? implies 
hw dw/dh = 1+ w?; or Tear dw = +dh. We can integrate both sides to get, 


In(V¥1+w?)=Inhk+c 
V1+w?=ch 
w= Vcth? ~1. 


Now, w = gh sO Saaazah = du and integration yields (with ch = cosh!) 
= =u+D 
~ cosh ch=u+D 
ch = cosh(cu + D) 
fi 1 


Cc 


cosh(cu + D). 
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Therefore, M is part of a catenoid. ) 


EXERCISE 5.3. In the proof above we assumed that h’(u) # 0. What happens if 
h'(u) = 0? Explain. 


EXERCISE 5.4. Show that a flat (K = 0) surface of revolution is part of a cone or 
a cylinder. Hint: Model your proof after the theorem above. 


EXERCISE 5.5. Show that if M is a compact, connected (oriented) surface with 
kK > 0 and H constant, then M is a sphere of radius Tk: 

Hints: (1) Consider the function H? — K = (k, — kz)*/4 and let p € M be a max 
for it. (2) If H? — K(p) = 0, show K is constant and use Liebmann’s Theorem. (3) 
Suppose H? — K(p) > 0 (so ki(p) > ke(p)). Note (k, — k2)(p) is a max. (4) Use 
Hilbert’s Lemma. 


3.6 SURFACES OF DELAUNAY 


Just as we saw that surfaces of revolution of constant Gaussian curvature 
are determined by certain integrals with chosen initial conditions, we can ask 
whether something similar occurs for constant mean curvature. Let’s take 
a surface of revolution M: x(u,v), where the patch has the form x(u,v) = 
(u, h(u) cosv, h(u)sinv). The mean curvature is given by 


_ 1 AR” +14+h" 
2 AL + h’?)3/2 * 


Suppose H = c/2 is constant. We then have the differential equation 
14h”? —hh” =ch(1+h”)?. 


First, consider the case when c = 0. The differential equation then reduces to 
1+h’*—hh” =0. This is precisely the equation we solved to get the catenoid. 
Of course this must be the case since the condition c = 0 is saying that M is a 
surface of revolution with zero mean curvature. Secondly, suppose c = +1/a 
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with a > 0. We get (for c =—1/a),1+h’? — hh” = —h(l+ h!?)2, or 


a(1 + h’?) — ahh” 


ie re 
opt | okt h!?) — ahh!" el esis 
(1+h!?)2 - 
2Qah!(1 +h’) — 2ahh’h”" 
a 4+ 2hh’ =0 
2 
d 2ah 9 
du te a =e 
h? + ess = +b? a constant. 


Jith? | 


If we take all cases into consideration (and note that the steps above may be 
reversed), we obtain the general form of the differential equation describing 
surfaces of revolution of constant mean curvature. 


Theorem 6.1. M is a surface of revolution of constant mean curvature pa- 
rametrized by x(u,v) = (u,h(u) cosv, h(u) sinv) if and only if the function 
h(u) satisfies 

2ah 


Vi+h? 


h?2+ = +b? 


where a and b are constants. 


It is an amazing fact discovered by Delaunay that this differential equation 
arises geometrically. That is to say, there is a geometric construction which 
produces the differential equation above and, consequently, all surfaces of 
revolution of constant mean curvature. We shall consider one example below 
in detail and then simply give the geometric characterization in a theorem. 


Example 6.1: The Roulette of an Ellipse. 

Suppose that an ellipse rolls without slipping along the z-axis. If we follow 
the path of one of the foci of the ellipse as it rolls, then this path gives an 
example of a roulette — a curve formed by a point associated to one curve 
as it rolls upon another. Consider the following “rolling” diagram where F’ 
and F” are the foci of the ellipse, K is the point of (tangent) contact with the 
x-axis and F'T' is the tangent to the curve traced out by F’. 

We will use various properties of the ellipse which were, perhaps, better 
known in years past (see (G], [Ya] or [Zw] for example). Let the ellipse have 
major axis of length a and minor axis of length b. Then it is known that the 
sum of the distances from the foci to a given point on the ellipse is constant 
and equal to 2a. In particular, FK + F’K = 2a. Furthermore, the ellipse has 
the beautiful (and useful) reflection property that any light ray originating 
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a Pe Py 


FIGURE 3.8. Ellipse rolling on the z-axis 


from one focus will reflect off the ellipse to the other focus. Since the angles of 


incidence and reflection are equal, this means that 6 def VF KP = ZF'KP". 
A more esoteric property of the ellipse (its pedal equation) is the fact that the 
product of the lengths of the line segments from the foci to any tangent line 
which meet the tangent perpendicularly is a constant equal to the square of 
the minor axis. In our case this means that FP- F’P’ = b?. Finally, a general 
property of this sort of roulette is the fact that the direction of the normal 
to the curve traced by F' passes through the point of contact with the line. 
Here, this means that F'K is perpendicular to FT. 

Now, the x and y coordinates of F' are given by OP and FP respectively. 
Then y = FK sin@. Because ZK FT = 7/2, we have ZFTP = 7/2 — 0. 
Moreover, ZFT'P is the angle between the tangent of the traced curve and 


the x-axis. That is, for the unit tangent of the traced curve T = (<, %), 
T dx 
in = (= - 9] a 4 Oy. 
sin # = cos | 5 (1, 0) 7 


Hence, y = FK #. Also, since ZFKP = ZF’KP’, we have y! @ F’P! = 
F'K &. Note that 


d 
yty' = (FK + F'K)— = 2a — 
by the fundamental property of the ellipse. Also, the pedal equation gives 
yy’ = b*. 
Solving for y’ in the second equation and plugging into the first equation gives 


dx 
2 2 
2ay — = 0. 
y ay ae 0 
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Now, the arclength is given by s = [ JVi+ty” dz, so a = L/AT eye: 
Replacing ae in the equation above and noting that its sign changes when 
ZFTK is large enough, we obtain 


2ay 


Vite 


This is precisely the equation we obtained from the assumption of constant 
mean curvature. Therefore, the roulette of an ellipse, an undulary, is a profile 
curve (i.e. meridian) of a surface of revolution of constant mean curvature, an 
unduloid. We note first that, for this case, we must have a > b. Also, we note 


that the companion equation y? + —% — b? = 0 is the differential equation 
p q y JVity® | 


describing the roulette of (the focus of) a hyperbola, a nodary. 


y? + >" = 0. 


FIGURE 3.9. The unduloid with a=1.5 and b=1 


EXERCISE 6.1. What is the roulette obtained when b = a? Hint: think geometri- 
cally. 


EXERCISE 6.2. What is the roulette obtained when b = 0. Hint: solve the differ- 
ential equation and then interpret geometrically. 


EXERCISE 6.3. Find the roulette of the focus of a parabola as it is rolled along the 
Z-axis. 


EXERCISE 6.4. Show that the differential equation y+ 2ay/4/1+ yl? +b7 =0 
may be put in the form 


dz = y? 


dy ./da2y2 — (y2 +62) 


This expression may be used with MAPLE to graph surfaces of Delaunay. 
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A surface M is immersed in R?° if there is a mapping f: M — R® which 
may not be one-to-one, but which has a one-to-one derivative map f, at every 
point. Also, although we shall treat the idea of completeness in Chapter V, 
we mention here that one property of a complete surface is that it is not part 
of some larger surface. Finally, recall that the word “conic” refers to any of 
circles, lines, parabolas, ellipses and hyperbolas. Now we can state Delaunay’s 
Theorem. 


Theorem 6.2 (Delaunay). A complete immersed surface of revolution of 
constant mean curvature is a roulette of a conic. 


More comprehensive discussions of the surfaces of Delaunay may be found 
in [Eel], [Ee2] (which we have mostly followed above) and, of course, [Dell]. 
When we talk more about surfaces with constant mean curvature in Chapter 
4, we shall see that they come about as the result of a variational principle 
(again, see [Eel], [Ee2]). That is, informally, the surface takes its shape to 
“minimize” surface area subject to having fixed volume. This principle has 
both physical and biological consequences. In particular, if a soap bubble is 
formed between two rings (with the ends included), then it holds a certain 
amount of air inside. If the amount of air is just right, then the bubble is 
a cylinder, the surface of Delaunay associated to the roulette of a circle. If 
the amount of air is changed or if the rings are pulled apart to lengthen the 
bubble, then the waist of the bubble narrows and an unduloid results. These 
types of experiments are discussed in D’Arcy Wentworth Thompson’s book 
On Growth and Form [DWT]. In Chapter 5 of this classic text Thompson 
describes informally how surface tension and pressure combine to determine 
the shapes of various one-celled creatures — shapes suspiciously Delaunayan! 


3.7 CALCULATING CURVATURE WITH MAPLE 


As might be expected, MAPLE may be used to calculate Gaussian and 
mean curvatures as well as such necessary ingredients as the metric coefficients 
E, F and G, the quantities (sometimes referred to as the coefficients of the 
second fundamental form) 1, m and n and unit normals. The procedures which 
carry out these calculations are listed below. Of course, for plotting surfaces, 
we should always start with 


> with(plots): 


Also, just as for curves, we can use the procedures for dot and cross prod- 
ucts and lengths. 


> dp := proc(X,Y) 


> X{1]*¥[1]+X[2]*¥[2]+X([3]*Y [3]; 
> end: 
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> nrm := proc(X) 
> —— sqrt(dp(X,X)); 
> end: 


> xp := proc(X,Y) 

> local a,b,c; 

> a := X(2]«Y[3]-X/3]*«Y [2]; 
> b= X[3}*¥[1]-X(1]*¥(3); 
> c := X([1]*Y(2]-X(2]*Y [1]; 
> [a,b,c]; 
> end: 


We also need to compute the Jacobian matrix whose columns comprise the 
tangent vectors to the parameter curves. 
> Jacf := proc(X) 
local Xu,Xv; 
Xu := [diff(X{1],u) diff(X[2],u),diff(X[3},u)); 
Xv := [diff(X[1],v) ,diff(X[2],v) diff(X[3],v)]; 
simplify ((Xu,Xv]); 
end: 


VVVVV 


The following procedures are then self-explanatory once you realize that 
each calls another procedure (i.e. Jacf or UN) to provide the blocks upon 
which it will build. For example, in EFG below, the very definition FE = x,-x, 
requires that Jacf(X) be called to provide x,. Similarly, EFG calls the dot 
product procedure dp to carry out the operations on the building blocks. Also, 
note the form of the output of EFG (for example) is a vector because, later 
on, we may want to only consider one of the metric coefficients and it is easy 
to write EFG[1] for E, EFG[2] for F and EFG[3] for G. 


> EFG := proc(X) 

local E,F,G,Y; 

Y := Jacf(X); 

E := dp(¥(1), Y{1)) 
F = dp(Y(1],¥[2)); 
G = dp(¥[2,Y[2)); 
simplify ([E,F,G]); 
end: 

:= proc(X) 

local Y,Z,s; 

Y := Jacf(X)); 

Zi= xp(Y(1],Y[2]); 
s := nrm(Z); 
PDE ela eee eree/ el) 
end: 
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Imn := proc(X) 


> 
> local Xu,Xv,Xuu,Xuv,Xvv,U,1],m,n; 

> Xu := [diff(X{1],u) diff(X[2],u) diff(X(3],u)}; 

> Xv := |diff(X{1],v) ,diff(X[2],v) ,diff(X[3],v))]; 

> Xuu := [diff(Xu{1],u),diff(Xu[2],u) ,diff(Xu[3],u)]; 
> Xuv := (diff(Xu[1],v) diff(Xu[2],v) diff(Xul3],v)}; 
> Xvv := (diff(Xv{1],v) ,diff(Xv[2],v) ,diff(Xv(3],v)]; 
> 

> 

> 

> 

> 

> 


U := UN(X); 

] := dp(U,Xuu); 

m := dp(U,Xuv); 
n := dp(U,Xvv); 
simplify ((l,m,n]); 
end: 
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Now it’s a relatively simple matter to put together procedures to calculate 


Gauss curvature and mean curvature. 


> GK := proc(X) 


> local E,F,G,1,m,n,S,T; 
> S := EFG(X); 

> T := lmn(X); 

> E := S(1); 

> F := §(2); 

> G := S{3}; 

> les Til); 

> m := T[2]; 

> n := T([3}; 

> simplify ((l#n-m~ 2) /(E*G-F~ 2)); 
> end: 

> MK := proc(X) 

> local E,F,G,l,m,n,S,T; 
> S := EFG(X); 

> T := Imn(X); 

> E := S(1); 

> F := §(2]; 

> G := §[3); 

> Ga WA 

> m := T[2}; 

> n := T[3}; 

> simplify ((G+l+Ex«n-2*F*m) /(2*E*G-2*F~2)); 
> end: 
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EXERCISE 7.1. Write a MAPLE procedure to calculate Gauss curvature (when 
F = 0) from the formula 


«= 5796 (5 (aa) + oe (yes) 


Here’s a procedure to plot the Gauss curvature of a surface on a given 
domain (which could be, say, the parameter domain of the surface). Note 
that, since dom|1] etc. are used in the procedure, MAPLE is expecting the 
domain to be given as a vector. 


> Gaussplot := proc(X,dom) 

> local k; 

> k:= GK(X); 

> plot3d(k,u = dom[1] .. dom[2],v = dom{3] .. dom/4], axes = frame, 
title = ‘Gaussian Curvature‘); 

> end: 


For example, if I want to plot the Gauss curvature function for the sphere of 
radius R, I would first enter 


> Rsphere:=[{Rxcos(u)*cos(v),R*sin(u)xcos(v),Rxsin(v)]; 
and then 
> Gaussplot(Rsphere,|0,2*Pi,-Pi/2,Pi/2]); 


and the output will be a three dimensional graph of the curvature function 
over the rectangular domain u = 0 to 27 and v = —7/2 to 7/2. Of course, 
what should this plot turn out to be? 


EXERCISE 7.2. Write a MAPLE procedure to plot the mean curvature of a surface. 


EXERCISE 7.3. Calculate Gauss and mean curvatures for the usual list of surfaces: 
helicoid, catenoid, torus, Enneper and hyperboloid(s). Also calculate for the pretzel 


surface 
x(u, v) = (cos(u) cos(v), sin(u) cos(v), sin(v) + wu). 


With some complicated surfaces such as Henneberg’s and Catalan’s surfaces 
(which we shall meet later), you must be careful when asking for curvature 
calculations. Always save your worksheet before asking for a complicated cal- 
culation. In particular, Henneberg will either give a mess which MAPLE has 
difficulty simplifying or simply crash. This probably depends on how much 
RAM is available. It is also possible to get a picture of Gauss curvature on 
the surface itself by coloring points with similar Gauss curvature by the same 
color. For example, the following colors the helicoid according to its Gauss 
curvature via the option “color=GK (helicoid).” 
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> helicoid:={uxcos(v),uxsin(v),v]; 


> plot3d(helicoid,u=0..1.5,v=0..5*Pi,color=GK (helicoid),style= 
patchnogrid,grid=(10,40}); 


EXERCISE 7.4: Color Enneper’s surface according to its Gauss curvature. What 
happens if you color according to mean curvature? 


As a less computational subject amenable to MAPLE, consider the surfaces 
of revolution of constant Gauss curvature. In Exercise 3.11, you were asked 
to determine the integral formula for g(u) when h(w) satisfies h” = —h. Of 
course, the answer to this question is that (with suitable shifting) 


h(w) = ccos(u) and =. g(u) = a \/ 1 — c2 sin? (t) dt. 


Notice that the second expression will only make sense when the squareroot 
is defined; that is, when u < arcsin(1/c). Here is a MAPLE procedure which 
graphs surfaces of constant positive Gauss curvature given an initial c and n. 
The n is just a parameter which allows a finer picture. That is, the larger n 
is, the more filled in the picture will be and the longer it will take to render. 


> with(plots): 


> plotconstcurv:=proc(c,n) 


> local X, list X,k,j,eval,desys,dequ,deqgu ; 

> X:=[h(u)*cos(v),h(u)*sin(v),g(u)]; 

> desys:=dsolve( {diff(g(u),u)=sqrt(1-c~ 2*sin(u)~2), g(0)=0},g(u), 

> type=numeric, output=listprocedure): 

> deqgu:=subs(desys,g(u)); 

> dequ:=subs(desys,u); 

> 

> eval:=proc(t,m, Y,du,dg;s) 

> local i; 

> seq(evalf(subs({h(u)=s*cos(du(t/m)), 
g(u)=dg(t/m),v=i/m},Y)),i=0..m*6.283): 

> end: 

> 

> if c<=1 then k:=nxevalf(Pi/2) else k:=nxevalf(arcsin(1/c)) fi; 

> listX:={seq(eval(j,n,X,dequ,deqgu,c),j=-k+0.00001..k-0.00001)}; 

> pointplot(listX,color=red,scaling=constrained); 

> end: 


We find g(u) above by converting the integral for g(w) into a differential 
equation by differentiating and using the Fundamental Theorem of Calculus. 
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MAPLE’s “dsolve” command then solves the differential equation numeri- 
cally and the option “output=listprocedure” allows us to keep track of the 
u’s and g(u)’s so that they may be inserted into other procedures. That’s 
exactly what the little embedded procedure “eval” does. Namely, when 
we do “listX” further on in the procedure, it creates a sequence of points 
which are the solutions of the differential equation evaluated in the param- 
etrization for a surface of revolution. In other words, the command listX:= 
{seq(eval(j,n,X,dequ,deqgu,c), j=-k+0.00001..k-0.00001)} takes the differen- 
tial equation solutions “dequ” and “deqgu” and shoves them into the param- 
etrization X. The “k” is the limit of integration arcsin(1/c) and 7 is just a 
parameter going between —k and k. The +0.00001 tacked on to & is just a 
way to avoid singularities in plotting, so there is nothing special about the 
number itself — feel free to change it. Finally, notice that I had to go through 
two evaluations; one to create the curve by solving the differential equation 
numerically and the other to revolve the curve around the axis. Then, once 
the sequence(s) of points are generated, the MAPLE command “pointplot” 
may be invoked to plot all the points in the sequences. This is a crazy way 
to do all this and once MAPLE’s elliptic integral package is improved, it will 
not be necessary. For then, the package will create the curve and you will 
only have to revolve it. In Chapter 5, geodesics will be drawn on surfaces 
using this simpler approach. Also, see [Gr] for the MATHEMATICA elliptic 
integral approach to constant curvature surfaces. 


EXERCISE 7.5. Try the plots plotconstcurv(1,10), plotconstcurv(0.5,15), 
plotconstcurv(0.95,12) and plotconstcurv(2,15). 
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FIGURE 3.10. Constant positive curvature surface of revolution 


EXERCISE 7.6. Write a MAPLE procedure to draw constant negative curvature 
surfaces of revolution. You may use the parameters A and B as in the text or 
take cases where h(u) = ccosh(u) or h(u) = csinh(w). Remember though that the 
squareroot must be defined. 

EXERCISE 7.7. Write a MAPLE procedure to graph surfaces of Delaunay. This 


exercise will surely test your MAPLE skills. Hint: just obtain half of the picture 
and reflect it to obtain the other half. 


Chapter 4 


CONSTANT MEAN 
CURVATURE SURFACES 


4,1 INTRODUCTION 


Much of science has to do with determining mathematical principles which 
underlie the phenomena of our universe. Although Kant seems to have over- 
stated the case in declaring that humans are a priori Euclidean geometers, 
there can be no denying the proliferation of physical theories based on geom- 
etry. From Newton, who gave only geometric arguments in the Principia, to 
Einstein, who formulated “gravitation” as the geometry of space-time, to the 
present-day creators of Grand Unified Field Theories, the underlying mecha- 
nisms of the universe are geometric in nature. 

Of course, in saying that the great forces of nature are consequences of some 
geometric structure, we lose sight of the fact that phenomena which we observe 
every day often display the same geometric dependence. Furthermore, we have 
already seen that nuclear cooling towers take the shape of hyperboloids of one 
sheet in order to prevent unnecessary stresses, so it should not be surprising 
that other types of construction might be similarly geometrically affected. 
Indeed, the Frei Otto school of architecture makes extensive use of minimal 
surfaces in its designs. 

In this chapter we shall give a brief introduction to minimal surfaces and 
surfaces of constant mean curvature. While these surfaces are intrinsically 
mathematically interesting, they also are objects of study and application in 
the sciences. In a later chapter we will consider this subject from a more 
mathematically advanced point of view (involving complex analysis). 

Surface tension is that quality responsible for the cohesiveness of liquids. 
Think of a liquid as a collection of (polar) molecules each of which exerts 
attractive forces of equal strength on the others. Deep inside the liquid a 
molecule feels equal forces from all directions, but near the surface of the 
liquid a molecule feels more of a force from inside the liquid than it does 
from the small number of molecules between it and the surface. Hence, those 
molecules near the surface are drawn into the liquid and the surface of the 
liquid displays a “curvature.” Thus, we see liquids form into drops or bubbles 
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because surface tension acts as a skin holding the liquid together. But a soap 
bubble doesn’t shrink away to nothing, so at some point surface tension must 
be balanced by an internal pressure. In the case of the soap bubble, of course, 
the atmospheric pressure inside the bubble is larger than that outside, so an 
eventual equilibrium is attained. Indeed, this war between surface tension 
and pressure is brought to a cease-fire by the mediating effect of geometry (in 
the form of curvature). 

Around 1800, Thomas Young (who is best known for his diffraction ex- 
periments establishing the wave nature of light) and Pierre Simon Laplace 
analyzed the phenomenon of surface tension in the following way. Consider 
the diagram below, which we take to be an “infinitesimal” piece of surface 
area expanded outward by an increased pressure. Note that, since the piece 
is very tiny, we may assume that the front and side curves are pieces of circles 
of radii R, and Roz respectively. 


r+ér 


FIGURE 4.1. Work done on a surface 


Let’s compute the work done to expand the area shown (which we denote by 
S). Remember that work is force x distance and pressure is force per unit 
area. We then get, 


W=F.-D 
=p-S-du 
=p-xry- du. 


But we can compute work in another way. Ask yourself, how could a soap 
film do work? Try this. Take a wire in the shape of a horseshoe and attach 
(loosely) a string to the two ends by loops. Hold the loops and dip the whole 
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object in a soap solution. Now let the loops go — what happens? The string 
is pulled to the top of the wire by the film! This means that the film has the 
potential to do work. Moreover, the quality responsible for contracting the 
string and the film itself is surface tension. Also, from this description we see 
that the amount of work done must be equal to the surface tension T (the 
units of which are force per unit length) multiplied by the change in surface 
area AS. Of course, 


AS = (x + 6x)(y+ dy) -azy 


from the picture (approximating the areas appropriately). Now, the length x 
is part of a circle of radius R; and x + 6z is part of a circle of radius R; + du 
with the same included angle. Therefore, we have the equality 


x+6é6a _ & 
R, + bu ~ Ry 


Treating y similarly gives 


Plug these into AS to obtain, 


du bu 
as=2(1+ 5) u(1+ 3) -ay 


=zry6 : pase +2 
ihe has ea 7 YR, RS 


If du is small, we may neglect the last term. Now put this expression for 
AS into the work formula W = TAS and equate it to our previous work 
calculation. We end up with, 


| 1 
p= Tf ( R + x) P 
This is the Laplace-Young equation. It says that a pressure difference on 
either side of a bubble or film is given by the product of the surface tension of 
the bubble or film and a quantity which is clearly related to the shape of the 
bubble or film. In fact, notice that the only requirement (implicitly) placed 
on our infinitesimal piece of area was that the curves meet at right angles (so 


S = zy). Also, notice that the quantities or and ie are the normal curvatures 
of the surface in those perpendicular directions. But, then (by Exercise 3.1.4) 
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we recognize the quantity 2H = 1/R;+1/R2 as twice the mean curvature of 
the surface. 

Now consider the case of a soap film bounded by a wire say. Since there is 
no enclosed volume, the pressure is the same on both sides of the film. Hence, 
p = 0 and (since T is constant) H = 0 as well. Therefore, we see that 


Theorem 1.1. Every soap film is a physical model of a minimal surface. 


It is clear that surface tension tries to make the surface as “taut” as possible. 
That is, the surface should have the least surface area among surfaces satisfy- 
ing certain constraints like having fixed boundaries or fixed volumes. In fact, 
this area-minimizing property implies the curvature property above and gives 
rise, in the case of films, to the term minimal. Let’s see how this works. 


4.2 First NOTIONS IN MINIMAL SURFACES 


To begin, let’s understand (in a somewhat informal way) how to integrate 
over a surface. Of course the simplest surface is the plane and here we do 
ordinary double integration. Our whole philosophy in dealing with surfaces 
has been to do all calculus computations in the plane and then transport them 
to a given surface via a patch x(u,v). Therefore, it’s no surprise that we do 
the same thing for this situation. In Chapter 1 we noted that the length of the 
cross product of two vectors equals the area of the parallelogram which they 
span. Now, what is the area of a tiny parallelogram-like piece of a surface? 
Since it is very small, we may think of the parallelogram as spanned by tangent 
vectors X, and x,. Hence, the area is given by |x, x x,|. This quantity then 
approximates the area of that tiny piece of the surface. Of course this is the 
usual modus operandi of calculus; approximate a small piece and then add up 
continuously (i.e. integrate) over the entire region. So, let x(u,v) be a patch 
for a surface M. Then we make the 


Definition 2.1. The area of the patch x, denoted Ax, is 


An= ff tux ml du dv, 


where the limits of integration are the defining limits of the patch. 


Furthermore, although we will not prove it, any compact oriented surface 
may be cut-up into a finite number of patches x (which, in fact may be taken 
to be oriented triangles) meeting only along their boundary curves with the 
opposite orientation. 
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FIGURE 4.2. Oriented triangulation of a patch 


Therefore, we may calculate the whole surface area of a compact oriented 
surface M as the sum of the areas of the individual patches 


A= f fbx du dv. 


Note also that if we miss a few points, this does not affect the integral. So, 
all of the patches we have considered up to this point allow surface area 
to calculated in a single integration. We shall return to this later and give 
some exercises, but right now our only aim is to use this idea to understand 
something about minimal surfaces. 

Finally, we will need Green’s theorem from calculus. Suppose P and Q 
are two real valued (smooth) functions of two variables x and y defined on a 
simply connected region of the plane. Then Green’s Theorem says that 


[ [ Ft fac [ pa ae 
y , OL Oy C 


where the righthand-side is the line integral around the boundary of the region 
C. Because we pull all integrals back to the plane for computation, we will 
find Green’s Theorem particularly useful. 

Now we may begin our analysis of minimal surfaces. Let z = f(x,y) be 
a function of two variables and take a Monge patch for its graph: x(u,v) = 
(u,v, f(u, v)). From Exercise 3.2.19 we have 


Xy, = (10) ta) Xuu = (0; 0; Jau) 
xX, = (0,1, f5) Ki 050; Jaw ) 
Xyy = (0,0, tae) 


- _ —fu,—fo, 1 
Xy X Xy = (—fu, —fv, 1) Se sy eseeses 
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E=1+f2 neon Pe G=1+f? 
f= fw — fuy — fuv 


~ /1+ f2+ f2 er J1+f24f2 " J1+f2+f2 


= Suufov =" Jas 
(C+ fi+ do)? 


ya (hit fe) fuu + (1+ fa) fov = 2fufofuv 
a(1+ f2 + f2)3 
The expression for H immediately gives the following 


Proposition 2.1. M is minimal if and only if 
fuu(l ae i) = 2hubofuv a fov(1 Se fa) = 0 


This partial differential equation is called the minimal surface equation. By 
placing algebraic or geometric constraints on the function f, we determine 
various types of minimal surfaces. 


EXERCISE 2.1. Suppose we require the algebraic condition that f(z,y) = g(x) + 


h(y). Show that H = 0 implies (1-+ h’”(y))g’ (x) + (1+9/7(x))h"(y) = 0. Separate 
variables and solve. Infer that 


f(2.4)= In (Se) 


cos ay 


The surface z = f(x,y) is called Scherk’s minimal surface. Note that Scherk’s 
surface is only defined for So a > 0. For example, a piece of Scherk’s surface is 
given over the square —5 < az < §, —§ < ay < §. In Chapter 7 we will discuss 
how different pieces of Scherk’s surface fit together. Surprisingly, only the catenoid 
and helicoid were known to be minimal in the 1700’s. Scherk’s surface was the next 
example of a minimal surface and it was discovered by Scherk in 1835. It is a wonder 


that the simple algebraic condition above was not considered earlier. 
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FIGURE 4.3. Scherk’s surface 


EXERCISE 2.2. Suppose we require the geometric condition that every level curve 
f(x,y) =c be a line. What types of minimal surfaces do we now obtain? Hints: 


1. In general, if a curve is given implicitly as f(x,y) = c, then the curvature is given 
by 


= feo hy + 2fefyfoy — fhe. 
vel 


Show this by invoking the implicit function theorem and writing f(z, g(x)) =c¢ 
with parametrization a(x) = (x, g9(x),c). Then, 


ge \g"'| 
ae 
(1+ 9'°)2 


Show g’ = -f by using Vf -a’ =0. Now find gq’ by implicit differentiation. 


3 

2. Show H = 0 fea + fyy = «| fe + fgl?- 

3. Now « = 0 (why?), so fer + fyy = 0. 

4, Show that a solution to Laplace’s equation above with the given geometric con- 
straint is 


f(z,yJ=A arctan( 2%) +B. 
r— £0 


If we let x — zo = ucosv, y — yo = usin, z — B = Av, then we see that the surface 
must be a plane or a helicoid. In fact, the proof that f(z, y) is the only such solution 
requires much harder analysis and is known as Hamel’s Theorem (see [Grau]). 
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EXERCISE 2.3. A unit speed curve on a surface a: J — M is said to be an 
asymptotic curve if a” .U = 0 for each point on the curve. By our formula 
a”. U = S(a')- a! = k(a’), we see that asymptotic curves are those curves which 
travel in directions of zero normal curvature. Show that a surface M is minimal if 
and only if through each point there are two orthogonal asymptotic curves. Verify 
that the parameter curves for the helicoid x(u,v) = (ucosv, usin v,v) are asymp- 
totic and orthogonal at each point. Hint: use K < 0, Euler’s formula for normal 
curvature and H = 0. 


We have seen in the last chapter that a nonplanar minimal surface of 
revolution must be a catenoid. Naturally, we can ask similar questions by 
imposing other geometric hypotheses. In Chapter 7 we shall consider the 
question of whether a minimal surface can also possess constant Gaussian 
curvature. Here, we confine ourselves to the following 


Theorem 2.2 (Catalan’s Theorem). Any ruled minimal surface in R° is 
part of a plane or a helicoid. 


Proof. (after [BC]) Let M: x(u,v) = G(u) + vé(u) where, without loss of 
generality, we make the following assumptions: (1) by Exercise 2.1.17, we 
may take @ perpendicular to the rulings of M (i.e. 6’-6 = 0) with unit speed 
(ie. 6-6’ =1). (2) 6 is a unit vector field of directions for the rulings 
along G (i.e. 6-6 = 1). Hence, we also have 6-6’ = 0. From these assumptions 
we derive the data, 


X, = 8+ v6’ Xy=6 X,XX,=f' x6+06' x6 
E=1+2vQ'-6' +076? F=0 G=1 
Xuu = 8" + v6" Xuv = 6 Xvy = 0 


— BY Bx 6+ vB" - 6 x 6+ v6": Bi x 6 +076". 5 x 6 
nn |: ae 


Then H = or = 0 since M is minimal, so = 0. Now, the numerator of 
is a polynomial in v, 


m=? n=O. 


£ 


BY. Bx b+v[B"-6 x6 +6": B' x 6] +076" - 5’ x 6, 


so for it to be zero, we must have each coefficient zero. Set the first term to 
zero, 3" - B’ x 6 = 0. This equation says that @” is contained in the plane 
spanned by @’ and 6, denoted (@’,6). Since @ is unit speed, @’ and @” are 
perpendicular. But by assumption, @’ is perpendicular to 6 as well. Since all 
three vectors are in the plane (@’,65), it must be true that @” is parallel to 6. 
In fact, since the length of 6 is 1 and the length of 3” is the curvature Kg of 
2, we must have B” = +Kg6. At any rate, we certainly have @” - 6’ x 6 =0. 
Now look at the coefficient of v set to zero and use @” - 6’ x 6 = 0 to infer 
6”. B' x 6 = 0. This says that 6” € (’,6) also. Now, the coefficient of v? 
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set to zero is 6”. 6’ x 6 = 0 and this implies that 6” € (6’,6) as well. Hence, 
6” € (6,6) M (8", 6). Now there are two possibilities. 

First, this intersection may be an equality (6’,6) = (@’,6). This follows if 
6” is not parallel to 6 at a point (so in some neighborhood as well). In this 
case, 3’ = aé’ since 3’, 6’ and 6 are in the same plane and both (’ and 6’ are 
perpendicular to 6. But then the unit normal of M would take the form 


- (beau) (BR 8). acy 
OST aaligicon Oe 


since |G’ x 6] = 1. But then, taking the derivative of U with respect to u, we 
have U' = 8” x 6+ 8’ x & = 0 since B” = kgé and @’ = a6’. Hence, U is 
constant, so M is part of a plane. 

Secondly, if (6’,5) # (@’,6), then the intersection is the line (6). Hence, 
6” = aé and, consequently, 3’- 6” = 0. But 6’- 6 = 0 implies kg = Kg6-6 = 
3" -6 = —' - 6’ and this, in turn, gives 


dep _ _ gw. 5 _ gg =0 

du 
since 3” and 6” are parallel to 6 and (’ and 6’ are perpendicular to 6. Hence, 
the curvature Kg is a constant. Now consider the torsion tg of 3. The usual 
formula becomes Tg = (3 x 6- 6’ since 3” = kg6 and, because k is constant, 
Bl" ==: Kad’ : 


EXERCISE 2.4. Show that 92 = 0. 


Therefore, Tg is a constant. Recall that a curve with constant curvature and 
torsion is a circular helix. Up to arigid motion of R°, we may then parametrize 
B by | 

3(u) = (Acosu, Asinu, Bu) 


with A? + B? = 1. Also, 6 is parallel to 6", so 6(u) = (cosu,sinu,0) since 
6 has unit speed. Let A+ v=. We then obtain a parametrization for the 
helicoid, x(u, v) = (dcosu, dsinu, Bu). O 


So, we see that the two most natural geometric conditions to impose along 
with minimality, being a surface of revolution or being ruled, lead to the 
minimal surfaces of the 18" century, the catenoid and the helicoid. 


EXERCISE 2.5. Plot the following surfaces. Show that Helcat, Enneper, Scherk 
and Planar Lines of Curvature are minimal and compute their Gaussian curvatures. 
You may find MAPLE to be of use here, but be careful. If you try Henneberg or 
Catalan, save your worksheet before actually computing! 
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1. Heleat. x(u,v) = (x1(u,v), 27(u, v),2°(u,v)), for any fixed t, where 
z! (u,v) = cos(t) sinh(v) sin(w) + sin(t) cosh(v) cos(u) 
x*(u, v) = —cos(t) sinh(v) cos(u) + sin(¢) cosh(v) sin(w) 
(u,v) = ucos(t) + vsin(t). 
2. Henneberg’s Surface. x(u,v) = (x'(u,v), x7(u, v), 23(u,v)), where 
a (u,v) = 2sinh(u) cos(v) — : sinh(3u) cos(3v) 
z(u,v) = 2sinh(w) sin(v) — = sinh(3u) sin(3v) 
(u,v) = 2 cosh(2u) cos(2v). 
3. Catalan’s Surface. 
x(u,v) = (u — sin(u) cosh(v), 1 — cos(u) cosh(v), 4sin(u/2) sinh(v/2)). 
4. Enneper’s Surface. 
x(u,v) = (u—u?/3 + uv’, v — vy /3+uu7,u? — v’), 
5. Scherk’s Fifth Surface. This surface is often written in nonparametric form sin z = 
sinhz sinhy. Parametrically, 
x(u, v) = (arcsinh(u), arcsinh(v), arcsin(uv)). 
6. Planar lines of curvature surface. 
cuxtsinucoshuy uv+ccosusinhv 


x(u, v) = —_ Vea ; + cos wcosh v) % 


QUE 


2 


ES 


FIGURE 4.4. A one-parameter family of minimal surfaces joining 
the helicoid and the catenoid 
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4.2 First Notions in Minimal Space 


Henneberg’s surface 


FIGURE 4.5. 


Catalan’s surface 


FIGURE 4.6. 


Enneper’s surface 


FIGURE 4.7. 
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FIGURE 4.9. Planar lines of curvature surface 


4.3 AREA MINIMIZATION 


Now let’s try to understand this word “minimal” a little better. In the mid- 
1800’s the Belgian physicist Plateau set the following problem: given a curve 
C, find a minimal surface M having C as boundary. Plateau was interested 
in thin films (e.g. soap films) and his problem was a natural outgrowth of his 
physical experiments. As we shall see below, least-area surfaces are minimal. 
In fact, it was eventually realized that hopes of finding a general method of 
solution for Plateau’s problem rested on producing least-area surfaces. Thus, 
another version of Plateau’s problem is to find a least-area surface having 
C as boundary. But even the existence of area minimizing surfaces is not 
automatic. In fact, it was only in the 1920’s and 1930’s that Plateau’s problem 
was first solved by J. Douglas and T. Rado (see [Dou] and [R]). They proved 
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Theorem 3.1. There exists a least-area disk-like minimal surface spanning 
any given Jordan curve. 


(A minimal surface is disk-like if its parameter domain is the unit disk D = 
{(u,v)|u? +v? < 1} and the boundary circle maps to the given Jordan curve.) 
To show the delicateness of this problem, consider 


EXERCISE 3.1. Let C be the unit circle in the zy plane. Show that the following 
variant of Plateau’s problem has no solution: Find the surface of least area which 
has C as boundary and passes through (0,0,1). Note that this problem begs the 
question: exactly what is a surface? Does a disk with a spike qualify (see Figure 
4.10)? This (along with an appropriate definition of area) was one of the main 
difficulties in considering Plateau’s problem. Of course from our very “differentiable” 
point of view, we obviate such considerations. 


FIGURE 4.10. Disk with spike 


Let’s look at a simpler question. What is a necessary condition that M 
have least area among surfaces with boundary C? The answer may be found 
through a simplified version of the calculus of variations as follows. 

Suppose M: z = f(z,y) is a surface of least area with boundary C. Con- 
sider the nearby surfaces which look like slightly deformed versions of M, 


M*: z'(a,y) = f(x,y) + tg(z,y). 


Here, g is a function on the domain of f which has the effect, when multiplied 
by a small t and added to f, of moving points of M a small bit and leaving 
C fixed. That is, g|—4 = 0, where C’ is the boundary of the domain of f and 
f(C) =C. A Monge patch for M® is given by 

x'(u,v) = (u,v, f(u,v) + tg(u, v)). 


Immediately we compute 


jx*, x x? | = J1+ f2 + f2 + 2t( fugu + foge) + t7(gu? + gv?). 
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By the definition of area, we see that the area of M* is 
= | [ VIF RFR Rae + Fog) + PGES G7) dude, 


Now take the derivative with respect to t (which then passes inside the inte- 
gral), 


2 2 
At) = ff a ae ee) tt 
L+ f2+ f2 + 2t(fugu + fogv) + t2(gu2 + gv?) 


We assumed z = Z) was a minimum, so A’(0) = 0. Therefore, setting t = 0 
in the equation above, we get 


| [PS aw-=0 
vJu f14+ f2t+ f? 


Now, let 
fug fug 


PS Q = ——_—_.. 
VI+ +5 Vit fit 
EXERCISE 3.2. Compute 9P F and ge and apply Green’s Theorem. 


We then get 


[[ 5 Fugut fou y ito 
v Ji+hi+f 
yf [ Mattie fall = abba ay 
(1+ f2+ f2)2 
fug dv fug du 


since g|@ = 0. Of course the first integral is zero as well, so we end up with 
[ [Meee eae 2fudvfuel a dy = 0. 
(ie jets2) 
Since this is true for all such g, we must have 
fuu(1+ fo) + fool + fa) — 2fufofuv = 0. 


But this is the minimal surface equation. Therefore, we have shown the 
following necessary condition for a surface to be area minimizing. 


Theorem 3.2. If M is area minimizing, then M is minimal. 
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EXERCISE 3.3. Let M be a graph z = f(z,y) above a domain D C R? with 
boundary a closed curve C. Show that, if f satisfies the minimal surface equa- 
tion, then M has least area among all graphs of functions z = g(x,y) with glo = 
f\lc. Hints: (1) a surface Z is formed by adjoining the two graphs along g(C) = 
f(C), (2) take the normal vector field for M defined in all of D x R by Uy = 

Tinea?’ a sae}: (3) show div(Uy) = 0, apply the divergence 
theorem and estimate each surface integral obtained by using the fact that Uy is a 
normal vector field for one of the pieces of Z, but not the other! 


EXERCISE 3.4. Determine the parameter a for the catenary y = acosh(2) which 
passes through the points (—.6,1) and (.6,1). Compute the surface area of the 
catenoid generated by revolving the catenary about the z-axis. Show that this 
surface area is greater than the surface area of two disks of radius 1. Hence, if 
the original boundary curve consisted of two circles perpendicular to the x-axis 
centered at (—.6,1) and (.6,1), then the catenoid would be a minimal, but non- 
area-minimizing, surface spanning the boundary. Hint: MAPLE (e.g. “fsolve” and 
numeric integration). 

In fact, more can be shown. Let 29, —xg be the points on the z-axis which are 
centers of the given circles of radius yo. . 

1. If a > .528 (approximately), then the two disks give an absolute minimum for 

surface area. This is the so-called Goldschmidt discontinuous solution. 
2.11 < .528 (approximately), then a catenoid is the absolute minimum and the 
Goldschmidt solution is a local minimum. 

3. If 528 < a < .663 (approximately), then the catenoid is only a local minimum. 
4, If a > .663 (approximately), then there is no catenoid joining the points. This 
can be seen using soap films by forming a catenoid between two rings and slowly 
pulling the rings apart. Measure re and you will see that the catenoid sponta- 
neously jumps to the two disk solution at approximately a = .663. For an informal 
discussion of this problem, see [Isen]. For a formal discussion, see [Bli]. 


4.4 CONSTANT MEAN CURVATURE 


Now let’s look at situations where mean curvature is non-zero, but constant. 
Before we begin, we present a very beautiful formula which will be the founda- 
tion for our study. Let M be a compact oriented immersed surface in R® with 
unit normal U = eel , mean curvature H and area A = f f |x, xx,|dudv. 
Recall that when we say that M is immersed, we mean that we allow self in- 
tersections, but not locally. That is, around each point in M there is a small 
open neighborhood for which there are'no self intersections. In terms of a 
patch x(u,v) for M, an immersion requires x,, and x, to be linearly indepen- 
dent at every point. Thus, a nonzero normal x,, x x, is well defined at every 
point of the surface. Enneper’s surface is an example of an immersed surface. 

To continue, just as in our derivation of the minimal surface equation, we 
perturb M a bit by a vector field V on M. Note that we use a vector field 
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here instead of a function g because M is not defined as the graph over some 


domain. 
M*: y‘(u,v) = x(u,v) + tV(u, v) 


and write the area of M* as 


A) =f five x yol dudy 


= ei Ixy X Xy|” + 2t(xy x Xy)(Xy x Vy + Vy X Xy) + O(t2) dud. 


Then, taking the derivative with respect to t and evaluating at t = 0, we 
obtain 


A’'(0) = [f{ eesmrw +V, x x,y) dudv 


x x X 


=f{ {Se xn- Xx, xX V,) dudv 


x XX 
= f [MoU xx -W-U xx, du dv. 


EXERCISE 4.1. Let P= —-V-U x xy andQ=V-U x xu, apply Green’s Theorem 
and the shape operator formulas of Chapter 3.1 to get 


Jf Vo Ux xu = Va x xy + V 2H x x0) ded 


Cc 


When we integrate over the entire collection of patches on M, the line integrals 
around the boundaries all cancel due to the orientation of M. Hence, over M, 
the lefthandside above is zero and we have 


= f [Vo x20 = Ve U x xy dudy 


-f [ ven, xX Xy) dudv 


- f f2xu -V dA _ since, by definition, dA = |x, x x,| dudv . 


Let’s do two quick applications of this formula. First, let V = x(u,v) so that 
y’ = (14+ t)x and A(t) = (1+12)?A since M is simply stretched uniformly. 
Clearly, A’(0) = 2A so we have 


A=~| [HU-xaA, 
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Thus, we get a formula connecting surface area to mean curvature. We will 
exploit this below. Secondly, let V = fU, a normal vector field. Of course, 
this is the type of force field we would expect to arise from pressure exerted 
uniformly on a surface. Now, U-U = 1, so we have A’(0) = —2f [ fH dA. 
Also, the geometric interpretation of the divergence of a vector field as a 
rate of change of volume V (see [MT, p. 227]) together with the divergence 


theorem itself gives 
v’(0) =f [ fawse dx dy dz 


=| [u-vaa 
=| fraa, 


(Recall that the divergence (or Gauss’s) theorem says that, for a surface 
M enclosing a volume Q, the volume integral of the divergence divV = 
OV! /dx! + OV? /dax? + OV3/Aax? of a vector field V = (V',V?, V3) is equal 
to the surface integral of the component of V normal to M. Formally, 


[f{ fawvan= ff v-vaa 


where U is the unit outward normal of M.) So, the problem of minimizing 
surface area (i.e. A’(0) = 0) subject to having a fixed volume (i.e. V’(0) = 0) 
is equivalent to finding H satisfying 


| [tH aa=o for all f with [ [taaqo. 


EXERCISE 4.2. Show that this condition implies H = c, a constant. Hints: (1) 
Write H = c+ (H —c) = c+J wherec = oe (2) Show ff J? dA = 
f f J(H —c)dA =0. (Use the hypothesis.) (3) Then J = 0. (4) Hence, H =c. 


Of course, the surface tension of a bubble works to achieve exactly the situa- 
tion above. Therefore, we have shown that 


Theorem 4.1. A soap bubble must always take the form of a surface of 
constant mean curvature. 


But what are these surfaces then? We have seen in Chapter 3 that, in the 
1800’s, Delaunay classified all surfaces of revolution with constant mean curva- 
ture. Of these, however, only the sphere is compact (i.e. closed and bounded) 
and embedded in R*. The question arose whether there are any other com- 
pact embedded surfaces of constant mean curvature? In other words, must 
non-self-intersecting soap bubbles always be spheres? The answer, originally 
due to Alexandrov, is yes. To prove Alexandrov’s result we make use of our 
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formula above together with a beautiful estimate due to Antonio Ros. In 
these results notice the essential connections between geometry (in the forms 
of area, volume and curvature) and vector calculus (divergence theorem, cur- 
vature). Of course, curvature is the link which binds these subjects together. 
We begin by giving Ros’s estimate, then prove Alexandrov’s theorem and, 
finally, prove Ros’s theorem. Ros’ theorem is pretty heavy going, but an un- 
derstanding of the ideas involved definitely has its geometric rewards. (The 
following is adapted from [Oss2].) 


Theorem 4.2 [Ros]. Let M be a compact embedded surface in R*? bounding 
a domain D of volume V. If H > 0 on M, then 


1 
— dA > 3V. 
hi 


Furthermore, equality holds if and only if M is the standard sphere. 
Theorem 4.3 (Alexandrov). If M is a compact embedded surface of con- 
stant mean curvature, then M is a standard sphere. 


Proof. Suppose H is constant and use the area formula above and the diver- 
gence theorem to get (where U is the interior normal) 


A=-f[ [ Hu-xaA 
aH] [ [divx dedyae 


=3HV. 


But then we have 


1 1 AA 
| fa=a] [a-g-a-¥ 


and by Ros’ Theorem, M must be a sphere. O 


Proof of Ros’s Theorem. First, let’s fatten M up a bit by taking a shell S on 
M’s inside given by 
S={p+eh(p)U |pe M} 


where U is the unit normal of M, 0 < « < 1 and A(p) is defined by 
h(p) = sup{r | the point p is the unique nearest point on M to the point q 
at a distance r from p along U}. 


This definition means the following. For any p € M, go in the direction of the 
inner normal U and consider each of the points g along the way. For some of 
these points g on the line p+ tU, p is the nearest point of M to q. Now just 
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focus on these points and take the distance r of the farthest such point. (In 
fact, there is a small technical point here. It is possible that a point q which 
is farthest does not have a unique nearest point on M. Think of the center of 
the sphere! This is why we take the supremum of the distances of the q’s for 
which p is the unique nearest point.) 

We consider this shell S because we use the volume V of D in our estimate 


and we will see that 
Vol(D) = Vol(S). 


Moreover, the volume of S may be calculated in terms of curvatures in a 
convenient way. In order to see the volume equality, first note that there can 
be no overlap on the interior of the shell. This follows from the definition 
of h(p) since every point of the shell lies on a unique normal line to M. Of 
course, there may be overlap on the inside boundary of the shell (as the sphere 
example shows), but this will have no effect on volume since the boundary is 
only 2-dimensional. 

Secondly, let g € D be arbitrary and let d be the distance from q to M 
(i.e. to M’s nearest point). Of course such a distance is obtained along a 
line perpendicular to M. At any rate, if B(q,d) denotes the closed ball of 
radius d about q, then the interior of B(q,d) contains no points of M and the 
boundary of B(q,d), denoted 0B(q, d), contains at least one p € M. Consider 
the radius of B(q,d) from q to such a p and take any q’ on this radius. Let r 
be the distance along the radius from q’ to p and note that B(q’,r) lies in the 
interior of B(q,d) except at p. Therefore, p is the unique point of M realizing 
distance r from q’ to M and all of these q’’s are in the defining set for h(p). 
Now, the supremum of the distances of the q’ is d since the q’ get arbitrarily 
close to q. Of course, there may be other points in the set further out, but 
nevertheless, we see that d < h(p). This means, by definition of the shell S, 
that q € S. Thus, every point of D is in the shell S and the only overlap of 
S occurs on the 2-dimensional boundary. Therefore, the volumes of D and S$ 
must be the same. 

Now, if M has a local parametrization x(u,v), then we may parametrize 
the shell S by 

y(u, v,t) = x(u,v) +tU(u, v). 


Note that we need three parameters u, v and t since the shell is 3-dimensional. 
Here, t varies as 0 < t < h(p) = h(u,v) where p = x(u,v). Now, a small 
chunk of S may be thought of as a parallelepiped determined by y,, yy and 
yz. Consequently, the small chunk has volume |y, x yy - yz|. The volume of 
S is then obtained by integration 


h(u,v) 
Vol(S) = gat lvu X Yo ° Y2| dt dudv. 
) 


To determine the integrand, for the moment assume that the parameter 
curves x(u, U9) and x(uo, v) are lines of curvature. That is, ki x, = S(xu) = 
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—-Vx,U = —U,, and kox,y = S(x,) = —Vx,U = —U,y. Then, 
Yu =X, +t, =(1—kit)x, and yy =x, +tU, = (1—ket)x,. 


This means y, xX yy = (1 — kit)(1 — kot) x, x Xy = (1-—2Ht + Kt?) x, x x,y 
since k} + kg = 2H and kiko = K. But H and K don’t depend on the 
parametrization chosen. So, even though we assumed x,, and x, were principal 
vectors to obtain the formula, in fact the formula must hold in general! Of 
course, y; = U, so this means |y,, x yy - yz| = |1 — 2Ht + Kt?||x, x x,| and 


h(u,v) 
Vol(S) = eal lvu X Yu yr|dt dudu 
0 
h(u,v) 
=f f | |1 - 2Ht + Kt?|dt > |x, x x,|dudu 
0 
h(u,v) 
=| [ pais Keath dA 
M 0 


h(u,v) 
=/ | \(1 — yt) (1 — ket)| dt dA. 
M JO 


From this expression for the volume of the shell we see that the principal 
curvatures come into the scheme of things. In fact, they are related to the 
function h(p) by the following estimate: 


1 
—~ > k 
h(p) = max{ki(p), 2(p)} 
for all p € M. Let’s see why this is so. Let qg be the point a distance h(p) 
along the normal from p. Then the open ball B(q,h(p)) cannot contain any 
points of M, for if there were a p’ € MN B(q,h(p)), its distance to g would be 
less than h(p) and for all q’ close to q on the radius from q to p the distance 


from q’ to p’ would be less than the distance from q’ to p. This contradicts 
the definition of h(p). 


EXERCISE 4.3. Prove this last statement. Hints:(1) h(p) is a supremum, so there 
is a sequence q’ — q with p the nearest point of M to q’ along a normal, (2) if a p’ 
exists, then the distance of gq’ to M is less than the distance of q’ to p, (3) since this 
is true for all q’ close to q, the sequence q’ — q can’t exist. 
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Hence, B(q, h(p)) contains no points of M and, therefore, the sphere S(q, h(p)) 
which bounds B(q, h(p)) lies completely inside M only touching at p. But this 
means that this sphere of radius h(p) has larger normal curvature at p in every 
direction than M. (Compare the proof of Theorem 3.5.2 for the opposite 
estimate.) Of course, the normal curvatures on a sphere are constant and 
equal to the reciprocal of the radius. In our case then, the normal curvature 
is always nO} and we have the required inequality. 


Now, the inequality above says that k; h(p) < 1 and kp h(p) < 1. Hence, 
(1 — kyt) and (1 — ket) are nonnegative for 0 < t < A(p) and 


h(p) h(p) 
[aa = katate [O(a = fata ta 
0 0 


The normal curvature inequality and the geometric-arithmetic mean inequal- 
ity ab < (242)? imply 


on ee ee me 
O~ Fey? =O poet Ato)” 
< (1 — ky t)(1 — ket) 
< (1 — Ht)’. 


Hence, ;+, > H(p) or HG) > h(p). This gives 


h(p) + 1 
0 3H 


and consequently, for V = Vol(S), 


h(p) 1 
v= | Fy (1 — ht)(1 — hat) dtaa < f aA 
M 0 M 3H 


as was desired. Finally, note that for equality to hold, it must hold in 
(1 — kyt)(1 — ket) < (1 - Ht)?. 


EXERCISE 4.4. Show that (1 — kit)(1 — ket) = (1 — Ht)? if and only if ky = ke 
every where. 


Hence, equality holds in the estimate if and only if M is all umbilic. By 
Theorem 3.5.1, this is true if and only if M is a sphere. O 
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Results such as Alexandrov’s Theorem led H. Hopf to conjecture that all 
immersed surfaces of constant mean curvature are spheres. It has only been 
in recent years that this conjecture has been shown to be false. H. Wente 
[We] was the first to construct tori of constant mean curvature which are 
immersed in R°. Since Wente’s work, many such have been constructed with 
important consequences for mechanics (for example). We have by no means 
given an exhaustive study of mean curvature. There are many interesting and 
important questions which remain and we hope that the brief outline above 
will lead the reader to further study. 


4.5 HARMONIC FUNCTIONS 


Before we leave this chapter, we want to indicate some of the connections 
between minimal surfaces and other areas of Mathematics. We will examine 
these connections more closely in a later chapter, but it is worthwhile to 
understand, from the outset, that this subject touches, and is touched by, 
many areas of Mathematics and Science. 

A very important partial differential equation studied in mathematical 
physics is the Laplace equation or potential equation. In two-dimensional 
Cartesian coordinates, the Laplace equation is 


(An analogous version of this equation is found in higher dimensions as well 
as in other coordinate systems.) This equation describes, among other things, 
steady-state temperature distribution, equilibrium displacements of a mem- 
brane and gravitational and electrostatic potentials. 


Definition 5.1. A real-valued function ®(z, y) is said to be harmonic if all 
of its second-order partial derivatives are continuous and each point in its 
domain satisfies the Laplace equation. 


Although in a later chapter we shall be precise in specifying the particulars of 
the relationship between harmonic and complex analytic functions, we men- 
tion here 


Theorem 5.1. If f(z) = u(x,y) + iv(z,y) is complex analytic (i.e. the 
derivative f'(z) exists at all points of a region of the complex plane), then 
each of u(x, y) and v(x, y) is harmonic. 


Example 5.2. Let f(z) = z*, where z =x +iy. Then 
z* = (2 + iy)? 
= x? + Qizy —y? 
= (x* — y*) + i(2zy). 
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So, in this case, Re(z”) = u(x, y) = x? — y? and Im(z”) = v(z,y) = 2zy are 
the particular harmonic functions (and consequently, harmonic conjugates) 
produced from f(z) = z?. Thus, since the partial derivatives for u are uz = 
22, Uy = —2y, Ure = 2 and uy, = —2, then u satisfies the Laplace equation, 
Une + Uyy = 2+(-2) = 0. Similarly, the partial derivatives for v are vz = 
2y, vy = 22, Vez = 0 and vy, = 0, so v satisfies the Laplace equation, 
Vex + Vyy =0+0=0 as well. 


EXERCISE 5.1. Let f(z) = z°?, where z = «+ iy. Calculate the real part of f. 
The harmonic function obtained produces the surface known as the monkey saddle. 
Calculate the monkey saddle’s curvature functions. Graph the monkey saddle. 


It is often useful to visualize what harmonic functions look like in R® in 
order to discuss some of their properties. The graphs of the two harmonic 
functions in the previous example are the standard “saddle” surfaces. It is 
not always a simple task, though, to recognize the graphs of more complicated 
harmonic functions. This is where our soap film can be used, since an infor- 
mal physical interpretation can be simply made using soap film. However, it 
cannot be said in general that graphs of harmonic functions are minimal sur- 
faces or vice versa. Consider the harmonic function u(z, y) = 2? —y? from the 
previous example. Calculating H for the corresponding Monge patch yields 


W= 2(1+4y?)—2(1+427) dy? — 42? 
~  Q(14+- 402 + 4y2) 1:4 dar? + Ay?” 


Clearly, the mean curvature for such a surface is not identically zero. Similarly, 
in general, minimal surfaces are not graphs of harmonic functions. Unfortu- 
nately, calculating comparative information from a minimal surface can be 
very difficult. Very little is known about the parametrizations for an arbi- 
trary soap film. Finding explicit parametrizations for minimum area surfaces, 
even when given relatively simple boundaries, has proved to be a nearly im- 
possible task since Plateau’s day. 

A natural question to then ask is, how closely do minimal surfaces approxi- 
mate harmonic functions? First, consider the plane. The function z = ®(z, y) 
would have the form az + by + c, which is clearly harmonic. (The variable 
z is not a complex variable here, but is the vertical coordinate in Cartesian 
3-space.) Now, what if the surface were nearly planar? Using an argument 
similar to the one we used above to show area-minimizing implies minimal, it 
can be shown that in this case the function ® which minimizes area across a 
Joop is harmonic as well. 


EXERCISE 5.2. Suppose z = f(z,y) is a function whose graph spans a given 
curve and has least area among surfaces spanning the curve. Further, suppose all 
higher powers > 4 of partials of f are negligible. That is, fi, fi and f2 fa are SO 


small that the integrand ,/1+ f2+ te of the area integral is well approximated by 
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4/(1+ al fe fra 14+ a[f2 a 2). Then, up to a constant provided by the term 


1 and the factor 7 the relevant area integral is 
A= | 2+ fpdedy. 


Now vary z = f(x,y) to get a nearby surface z*(x,y) = f(x,y) + tg(x,y) and carry 
through the derivation of Theorem 3.2 as before using the area integral above to 
show that f is harmonic. See [CH]. 


Unfortunately in most cases, as the minimal surface becomes less planar, 
it begins to vary from the graph of a true harmonic function. The argument 
used above breaks down when fourth-order terms are no longer negligible. The 
soap film is less and less a good approximation of the graph of a harmonic 
function. 


Another interesting relationship between minimal surfaces and harmonic 
functions comes about when the surface is parametrized by isothermal coordi- 
nates. A parametrization x(u, v) is called isothermal if E = G and F = 0. We 
will show later that every minimal surface has a parametrization with isother- 
mal coordinates. When isothermal parameters are used, there is a close tie 
between the Laplace operator Ax = x,y, + Xy, and mean curvature. Recall 
that we have the following formulas for an orthogonal coordinate system: 


E. E 
Xuu = yo 3 ema +1U, 
Xuy = oma + 9G +mU, 
Gy 
Xyy Gu X, + — x, + nU. 


~ OB 2G 


Theorem 5.2. If the patch x is isothermal, then Ax = Xuu t+ Xyy = 
(QE H)U. 


Proof. 
E,, E, Gy Gy 
een = (Sx ae +10) + (St, + Se, +n) 
E E E. E. 
sto 2 a! yy 4nU ince E=G 
aha apr tlU apt + op* +n since 
=(l+n)U 
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By examining the formula for mean curvature when EF = G and F = 0, we 


see that 
- El+En _ l+n 


H 
2 2E 


Therefore, 
Ket hi CEA). 


O 


Corollary 5.3. A surface M: x(u,v) = (z}(u,v),2?(u,v),23(u,v)), with 
isothermal coordinates is minimal if and only if x',x? and x° are harmonic 


functions. 


Proof. If M is minimal, then H = 0 and, by the previous theorem, Xyy+Xvy = 
0. Therefore, x is harmonic. On the other hand, suppose z', x”, and 2° are 


harmonic functions. Then x is harmonic, so Xy,+Xyy, = 0 and, by the 
previous theorem, (2 H)U = 0. Therefore, since U is the unit normal and 
E=x,:xX, #0, then H = 0 and M is minimal. O 


This result will have a major impact on our later discussion of minimal 
surfaces. It is, in fact, the link between the geometry of minimal surface theory 
and the powerful methods of complex analysis. For more on minimal surfaces 
and their relation to the sciences, see Chapter 7 as well as the wonderful 
books [Ni], [DHKW}, [Oss1], [Bo], [DWT], [HT] and [Isen]. In particular, 
[Ni, Introduction] and [HT] provide histories and overviews of the calculus 
of variations, minimal surfaces and the uses of minimal surfaces in science, 
engineering and architecture. 


Chapter 5 


GEODESICS, 
METRICS, 
AND ISOMETRIES 


5.1 INTRODUCTION 


In axiomatic geometry the axioms focus on the characteristics of the funda- 
mental objects of geometry, points and lines. Moreover, in order to test these 
axioms, certain models of geometry are constructed. For example, Riemann’s 
non-Kuclidean geometry is modelled by the sphere with lines being the great 
circles (i.e. circles on the sphere having the center of the sphere as their own 
center). With these definitions, the sphere models a geometry where there do 
not exist any parallels to a given line through a given point. Of course the 
ramifications of such a result are tremendous. In particular, the angle sum of 
a triangle is greater than 180°. 

If differential geometry is to somehow connect with traditional axiomatic 
geometry, then it must produce its own abstract definition of line (as opposed 
to the Euclidean axiomatic nondefinition of it) and show that this works for 
the typical models of Euclidean and non-Euclidean geometry. In particular, 
if we have the correct notion of line, then we should be able to prove (rather 
than assume or define) that the lines of the sphere are great circles! 

So what should the differential geometric notion of line be? Previously we 
showed that a straight line in the plane gives the shortest distance between 
two points. We could attempt to use this “distance-minimization” criterion 
as our definition, but it is a hard condition to check. Instead, let’s pick out 
another property of a straight line — a property which is readily calculable 
and which also characterizes the line — the vanishing of its second derivative. 

Now, we cannot take a curve a on a surface M in R®? and require a” = 0, 
since this would just give us a straight line in R® and, for an arbitrary M 
(such as the sphere), there is no reason why such a line should remain on 
M. But, let’s think of all this from the point of view of a resident of M. 
That is, let’s take the viewpoint of a creature who lives on M and has no 
perception of the 3° dimension given by the unit normal U. Because the 
creature cannot see anything outside of the tangent plane T,M, it cannot see 
the normal component of acceleration. Suppose a has unit speed. Then we 
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have two perpendicular unit vectors T = a’ and U (the unit normal of M). 
A third such vector may be obtained by taking T x U. Now, three vectors 
which are mutually perpendicular form a basis of R?, so any vector is a linear 
combination of them. Hence, we may write a” = AT+ B(T x U)+CU and 
calculate the coefficients as 


a’ .T=A, a’ -TxU=B, a VU =, 


where we use the fact that JT, T x U and U are unit vectors. Hence, we may 


write 
a” = (a"-T)T + (a"-TxU)(T x U)+(a"-U)U. 


Further, a’: a’ = 1 since a has unit speed, so the product rule for differentia- 
tion gives a” - a’ +a’- a" = 2a’-a"” = 0. Hence, a’: a” = T-a” = 0 and we 
have no T-component for a”, 


a” =(a"-TxU)(TxU)4+(a"-U)U. 


The usual identities involving dot product and cross product give U:T xU = 0 
and a”-T xU =U-a" xa',soT x U is in T,M for all p € M and 


a” -TxU=U-a" xa’ 
= |U| |a” x a’|cosé 
= |a” x a’|cosé 
= Kg, cos 9 


where Kg is the curvature of a and @ is the angle between a” x a’ and U. 
This quantity is often called the geodesic curvature of a and is denoted 


Kg = Ka cosé. 


EXERCISE 1.1. Show that the following relation holds between the curvature Kq of 
a, the normal curvature k(a’) of a’ and the geodesic curvature Kg of a: 


Ke = k(o!)? + a 


Hint: how is the normal curvature of a’ related to the curvature of a? 


EXERCISE 1.2. For the topmost parallel (i.e. u = 4) on the torus, compute K, Kg 
and k and show that the relation above holds in this case. 


So now we see that we are able to decompose acceleration into tangential 
and normal components 
Qlan =KgT XU and OQ anner = (aU) U, 


In keeping with our viewpoint as a resident of M, we see no acceleration 
exactly when at/,,, = 0. Therefore we make the following definition. A curve 
ain M is a geodesic if at’, = 0. 
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EXERCISE 1.3. For a non-unit speed curve a(t) with speed v, show that 


a" = ST + ng T XU t(a"-U)U 


Hints: (1) Recall T = x and differentiate a’-a’ = v? to get a” -T = S- (2) Use the 


, 
formula for the curvature of a non-unit speed curve to show that a”. 2 xU = kg vy, 


The geodesics of a surface will be the lines of our geometry. We first note 
a very simple property of a geodesic (which also follows from the exercise 
above). 


Lemma 1.1. A geodesic has constant speed. 


Proof. The speed of a is vy = |a’|, so v? = a’ - a’. Differentiation yields 
2st = a"-a! +a!- a! = 2a'-a" =0 since a” = ag and a! na 0! = 0. 
Hence, de = 0, so v is constant. O 


Note also that a straight line a(t) = p+ tg lying in M must be a geodesic 
because a” = 0. Of course, as we expect, this condition characterizes geodesics 
in our familiar Euclidean geometry. To see this from our definition, suppose 
P is a plane with normal U and a is a geodesic in P. By definition, at/,,, = 0, 
so a” = (a”-U)U. But a’-U = 0 since a lies in P and the product rule 
gives, 

0=(a'-U) =a"-U+a'-U' =a" -U 


since U is constant. Hence, since both the tangential and normal components 
of a” vanish, we have a” = 0. Therefore, a must be a straight line. 


Another class of examples is provided by surfaces of revolution. Let a(u) = 
(g(u), h(u), 0) be a curve with unit speed parametrization. Recall that the as- 
sociated surface of revolution (about the x-axis) has parametrization x(u, v) = 
(g(u), h(u)cosv, h(u)sinv) with 


B= xXy xy =), F=0, G=X,-X, =h’ >0. 
Let’s differentiate E and F' with respect to u. Recall that this means we take 


the directional derivative of these functions in the x, direction. Since EF = 1 
and F' = 0, we have 


0=x,[E] 0= x,[F] 
KiKi al = Ki [Xy2 Xy| 
= Xuu' Xu + Xu Xuu = Xyu + Xy + Xy° Xuv 


= 2Xu° Xuu = Xuu ° Xv 
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since Xy ‘Xu, = (g’,h’cosv,h’sinv) - (0,—h’sinv,h'cosv) = 0. Hence, 
Xuu *Xu = 0 and Xyy +X, = 0, sO Xyy, is perpendicular to T,(M). That 
is, (Xuu)tan = 0. Hence, meridians of surfaces of revolution are geodesics. We 
shall generalize this statement shortly when we consider Clairaut parametriza- 
tions. 


Example 1.1: Geodesics on S2. Let S? be an R-sphere with parametri- 
zation 


x(u,v) = (Rcosucosv, Rsinucosv, Rsinv). 


Here we see that the parametrization gives a surface of revolution with g(v) = 
Rsinv and h(v) = Reosv, so the v-parameter curves are geodesics. That is, 
longitudes from the North pole to the South pole are geodesics. Since the 
sphere is symmetric about its center, by an appropriate rotation we may 
consider it as a surface of revolution about any line which passes through 
the center. The v-parameter curves are then circles on the sphere which 
connect the points of intersection of the line with the sphere. Since these 
points correspond to the North and South poles under the rotation, the circles 
connecting them are great circles on $2. Recall that a great circle is a circle 
of radius R on an R-sphere and, therefore, has center the center of the sphere. 
This discussion shows that all great circles on $3 are geodesics. To see that 
great circles are the only geodesics on $3, suppose a is a unit speed geodesic. 
Then aj, = a” — (a”-U)U = 0 by definition, so a” = (a”-U)U. But, on 
SR, we know that U(a(t)) = a(t)/R, so a” = #2(a”-a@)a. Also then, 


(a x U)! = = (a! x 0)! 


1 
= pla! xata’ x a’) 


=0 


since a” is parallel to a and a’ is parallel to itself. Then a’ x U df Visa 
constant vector with a: N = 0 since N is perpendicular to U = a/R. But 
this means that a lies in the plane having N as normal vector. Further, since 
U = a/R lies in the plane, from any a(t) we can get to the origin (0,0,0) and 
stay in the plane. Hence, the plane passes through (0, 0,0) as well. Since a also 
lies on S2, it is in the intersection of the sphere with a plane passing through 
the origin. Therefore, it is (a part of) a great circle. The following exercise 
provides another approach, in terms of the Frenet Formulas, to characterizing 
geodesics on the sphere. 
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Geodesic on a sphere 


FIGURE 5.1. 


EXERCISE 1.4. Show that a curve a on S% is a geodesic if and only if it is a great 


circle. Hints: (1) suppose a@ is a unit speed geodesic with a’ =Tanda”’ =T’=KN. 


(Note that 


2 
R: 


the first part of the formula for S holds in general.) (2) What does this imply about 


Kq and To? 


T/R, where S is the shape operator of S' 


Show S(T’) = kaT — TaB 


EXERCISE 1.5. Show that a geodesic on M, a, which is contained in a plane is also 


O and the binormal B is the plane’s normal. 


a line of curvature. Hint: To 


EXERCISE 1.6. Show that a curve a in a surface M is both a geodesic and a line 


of curvature in M if and only if @ lies in a plane P which is perpendicular to M 


everywhere along their intersection. Hint: Frenet Formulas. 


EXERCISE 1.7. Find the geodesics on the cylinder M/: + y*% = R?. Hints: (1) 


find aja, and 


tan: 


” 


U+a 
d?y 
‘at? * 


) 2 
and 


—R( 
d7u 
‘at? 


0. (3) What conditions are imposed on 


” 


(Ros u(t), Rsin u(t), bu(t)) and take a’’. (2) A unit 
du 
at 


(cos u, sin u, 0). Decompose a 


Write a(t) = x(u(t), u(t)) 
normal is U = 

wo 

tan ~— 


set a@ 


Geodesic on a cylinder 


FIGURE 5.2. 
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Another way to see that our notion of geodesic is correct is to consider the 
following special situation. Suppose M is a surface with a patch (near a given 
point) x having E = 1, F = 0 and G > 0. (In fact, such a patch near a 
point may always be found. It is called a geodesic polar coordinate patch. See 
Chapter 6.) For instance, a surface of revolution generated by a unit speed 
curve has E = 1, F = 0 and G = h(u)? > 0. Let a: [89,81] > M be a unit 
speed geodesic. Then the arclength of a is 


81 Si 
La) = [ ja'|ds = | lds = 8; — 89. 


so 80 
Now take another curve (3: [89,81] + M with G(s9) = a(so) and @(s1) = 
a(s;). Note that we can always take the same interval {s9, s,] by reparametriz- 
ing. Also, in an appropriate region, the implicit function theorem allows us 
to write, G(s) = x(s,g9(s)) for some function g of s. We then obtain 


ds dg 
! _—— — — 


=X, + Xv’ 


and, 


(A"(s)| = V Xu * Xu + 2x, °Xyg' + Xp Xgl" 
= \/E+Gg" 
= 1/1+ Gog" 


>1 


since G > 0. Hence, we have the following estimate for the length of (, 


(6) = [ \s'las 
=) V/1+ Gg’? ds 
80 
y lds 
80 


= 8 — 89 = D(a). 


Thus, geodesics minimize the distance between two points in (an appropriate 
neighborhood contained in) M. 
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EXERCISE 1.8. Take the points (0,1,0) and (0,1, uo) on the cylinder 2? + y? = 1 
and consider both the straight line and helical geodesics joining them. What goes 
wrong in the argument above since it is clear that the helical geodesic is not shortest 
length. Hint: here is a situation where the fact that a surface may not be completely 
covered by a single patch has a true geometric implication. 


5.2 THE GEODESIC EQUATIONS AND THE CLAIRAUT RELATION 


Now let’s try to get a handle on the calculation of geodesics in a more 
general way. In the following, we only consider orthogonal patches x(u, v) (i.e. 
F = xXy-Xy = 0). Let a be a geodesic in the patch x. Then a = x(u(t), v(t)) 
and a’ = x,u' + Xyv’ with 


" 2 ! ! 2 
a” = Xyyul + Xyy'w + xyul + Xyyul’ + Xu + xy". 


Using the formulas for xy, Xyy and x», obtained in Chapter 3, we obtain 


E 2 E G 2 
wo " uit Ui tt Sau 
Qa =Xy c + RY a a oF | 
E. 2 G 2 
" vot Uitte Uv Ys 
+x, [o ele Dae ae + 356 | 


+U ftw? + Q2mu'v! + nv"? 


where the first two terms give the tangential part of a’’. For a to be a geodesic 
then, it is both necessary and sufficient that the following geodesic equations 
are satisfied. 


te Pg Pog Ce 2g 
(Geodesic Equations) 2 E 2E 
Ey 12 Gu vi 72 
yl — ul 4 Se '' + Mv _Qg 
2G G 2G 


EXERCISE 2.1. Find the geodesics on the cylinder arty? =1 by using the geodesic 
equations. 


EXERCISE 2.2. Suppose that a curve a satisfies the first geodesic equation and is 
also constant speed (i.e. |a’|? = u?E+u"G= c). Differentiate the constant speed 
relation, replace u” by the first geodesic equation and show that you obtain the 
second geodesic equation. Therefore, the constant speed relation takes the place of 
the second geodesic equation in the case of a constant speed curve. Now, reverse 
your calculations to show that the two geodesic equations imply that the curve must 
be constant speed. This is a fact that is hidden by the geodesic equations, but which 
is essential. Hint: in the first part, be very careful with the chain rule. 
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The geodesic equations are a system of 24 order differential equations. 
Given initial data consisting of a point on M and a tangent vector at the 
point, the theory of ordinary differential equations guarantees the existence 
and uniqueness of a geodesic on M through the point and having velocity 
vector equal to the given tangent vector. We state this formally as 


Theorem 2.1. Let p = x(uo,vo) be a point on a surface M: x(u,v) and let 
v € 7T,M. Then there is a unique geodesic a: (—r,r) + M with a(0) = p 
and al (0) = v. 


Proof. We want to obtain a geodesic a(t) = x(u(t), u(t)) with the property 
that a/(0) = u’(0)xu(uo, vo) + v’(0)x,(uo, vo) = v. Since v is fixed, this 
gives prescribed values to u’(0) and v’(0). Together with the initial values 
u(0) = up and vu(0) = vo and by the basic existence and uniqueness theorems 
of differential equations, this is precisely enough information to determine a 
unique solution (in some interval about 0) to the geodesic equations. 0 


EXERCISE 2.3. The plane and the sphere have the property that all geodesics are 
plane curves. Show that this property characterizes the plane and the sphere. That 
is, show that if M is a surface such that every geodesic is a plane curve, then M is 
a part of a plane or a sphere. Hint: Use Exercise 1.5, Theorem 2.1 and Theorem 
3.5.1. 


Example 2.1: The Unit Sphere S?. Take the standard patch 
x(u, v) = (cos ucosv, sinucosv, sinv) 
and calculate E = cos? v, F =0 and G=1. The geodesic equations become, 
u’ —2Qtanvu'v’ =0 y" +sinvcosv wu. 


As it stands, this is a formidable system of nonlinear differential equations. In 
fact, it is hardly ever the case that the geodesic equations are solved directly. 
The sphere provides a situation where we can use a standard trick to determine 
geodesics. First, assume without loss of generality that a(t) = x(u(t), u(t)) is 
a unit speed geodesic. Then, besides having the geodesic equations, we also 
have a! = u'x,+v'xy, which leads to the unit speed relation 1 = Eu’ gee et ad 
On the unit sphere this is simply 1 = cos? vu’ ? + y!?. Solve the first geodesic 


equation as follows: 
al! 
/ —= i 2Qtanvv’ 
u 


Inu’ = —2Incosu+c 
Retin iC 


u = —-. 
cos? v 
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Now replace wu’ in the unit speed relation by cost tO get 


5 2 
l=—, cos? v + vu! 
cost y 
72 ce 
vi =l1-— 5 
cos? v 


; cos? v — c? 
v = 4/——-. 
cos? v 


Dividing u’ by v’ produces the separable differential equation 


du Cc 


dvs cosvVcos? v — c2 


which may be integrated (by making the substitutions w = Fos tanv and 
w = sin@ in steps 4 and 5 below respectively) to produce 


Cc 
u= {| — 
cos vy cos? v — c? 
csec? v 


V1 — c* sec? v 


csec? y 


> V1 — 2 — tan? v 
i dw 

V1—w? 
= [a 


seats ( ctanv ) id 
= cslin ————— . 
V1 —c? 


Therefore, we have 
sin(u —d) = A tanv 


= We may expand sin(u — d) = sinwcosd — sindcosu and 


where \ = Fo: 


find a common denominator cos v to get ' 


sin u cos v _ .cosucosy Asinv 
——— cosd — sind ————- — = 
cos uv cos uv cos Uv 


If we make the substitutions x = cosucosv, y = sinucosv, z = sinv and only 
consider the numerator, we get 


y cosad—axzsind—Az=0. 


Hence, the geodesic equations imply that a lies on a plane az + by + cz = 0 
through the origin. Just as in our previous discussion of the sphere, this means 
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that a@ is contained in the intersection of a plane through the origin with the 
sphere — and this is a great circle. 

Before we leave the sphere, let’s look at one other piece of information 
provided by the geodesic equations. Let ¢ denote the (smaller) angle between 
a’ and x, at any point along the unit speed curve a. Then (using the angle 
5 — @ between a’ and x,) we get 


sin @ = cos(— — 6) = —— = —= [(uxy t+ UXy)-Xy] =uVE =u cos. 
ng=c (5 ¢) [| [xu TR ut UXy) + Xu] 
But u’ = =S; from the geodesic equations, so we obtain 
; c 
s — ; 
me cos U 


This is a special case of what is known as Clairaut’s relation. We shall discuss 
this in general shortly. For the moment, let’s look at what the relation implies 
about the behavior of geodesics (i.e. great circles) on the sphere. Suppose a 
geodesic a starts out parallel to a latitude circle (i.e. a u-parameter curve) 
x(u, Uo). This means that a’(0) is parallel to x, (uo, vo), so the angle ¢ is 5. 
The Clairaut relation says that 1 = sin 5 = aan so C = cosvp. Along a we 
then have cos vsin ¢ = cos vp and, since | sin ¢| < 1, cos? v > cos? up. Because 
— 5 <u<G, this means cosv > cosvo. We then have the cases 


v<uo for O<v,u <5 
v>vo for —5 <u, vo <0. 


Since v represents latitude, these inequalities mean that the great circle a is 
pinched between the latitudes vg above and vp below the equator. 


Example 2.2: The Torus. Take the torus M with parametrization 
x(u,v) = ((R+rcosu)cosuv, (R+rcosu)sinv, rsinu). 


We know E = r?, F = 0 and G = (R+rcosu)?, so the geodesic equations 
are 


rsinu 
/ ‘og 
: v =0. 


(R+rcosu) , 12 
-~_—___—— sin — 2 y'y' = 
(R + rcos u) 


ul + uv’ =0 
The second equation is separable and we have v’ = Gara Again assum- 
ing @ is unit speed, we replace v’ in the unit speed relation by ar ciay? to 
get 


l= ry! + eee 
(R+rcosu)? 
ul 1 e 


~ VO (R+rcosu)?’ 
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Now we divide v’ by u’ to obtain 


dv erVR+rcosu 


du (R+rcosu)* — c? 
52 erJ/R+rcosu du 
J(R+rcosu)2—c2 


Unfortunately, unlike the sphere, we cannot integrate the righthand side ex- 
plicitly. Therefore, the Clairaut relation takes on extra importance since 
it helps us to visualize the path of a geodesic. For instance, the Clairaut 
relation for the torus is (R + rcosu)sing = c, where ¢ is the angle be- 
tween a’ and x,,. Suppose @ starts out parallel to the topmost parallel circle 
x($,v) = (Reosv, Rsinv, r). Then ¢ = § and the Clairaut relation gives 
R=c. Again since |sin¢g| < 1, we have R+rcosu > R all along a. This 
implies that cosu > 0 and, consequently, -} <u < 4. Hence, the geodesic is 
confined to the outside of the torus and, in fact, bounces between the topmost 
and the bottommost parallels. 


ee 


FIGURE 5.3. Geodesic on a torus 


What do these two examples have in common? For both the sphere and 
the torus, the patches are orthogonal and FE and G depend only on u or only 
on v. We say that an orthogonal patch x(u, v) is a Clairaut parametrization in 
u if £, =0 and G, = 0. The patch is Clairaut in v if E, = 0 and G, =0. Of 
course, as we have seen, the sphere is Clairaut in v and the torus is Clairaut 
in u. The geodesic equations simplify in these cases to 


(Clairaut in u) 


(Clairaut in v) 
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In the following we shall focus on u-Clairaut parametrizations, but every- 
thing we say also applies to the v-Clairaut case as well. Take a u-parameter 
curve x(u, vo) for a u-Clairaut patch x and assume it is unit speed. In par- 
ticular E = x,-xX, = 1. Then, clearly, x(u, vo) satisfies the Clairaut geo- 
desic equations since v = vp and FE, = 0. Hence, for a u-Clairaut patch, 
u-parameter curves are geodesics. Of course this is a generalization of our 
previous discussion on surfaces of revolution. Also, note that we assumed the 
u-parameter curve is unit speed. While every curve can be reparametrized 
to have unit speed, it is not often the case that u-parameter curves naturally 
come with a unit speed parametrization. Some authors give the name pre- 
geodesic to curves which are not constant speed, but when reparametrized to 
have constant speed, become geodesics in the sense above. We shall simply 
continue to use the term geodesic for these curves. 


Since u-parameter curves of u-Clairaut patches are geodesics, it is natural 
to ask whether the same is true for v-parameter curves x(uo, v). The geodesic 
equations become 


where v’ = 1 and G, is evaluated at up. So, it is clear that the v-parameter 
curve x(uo,v) is a geodesic if and only if G,(uo) = 0. In particular, this 
applies to surfaces of revolution where G(u) = h(u)*. Putting this together 
with the discussion above gives 


Theorem 2.2. Let M: x(u,v) be a surface with u-Clairaut patch x. Then 
every u-parameter curve is a geodesic and a v-parameter curve with u = ug 
is a geodesic precisely when G,,(ug) = 0. 


Corollary 2.3. For a surface of revolution having parametrization x(u, v) = 
(g(u), h(uw) cos v, h(u)sinv), any meridian is a geodesic and a parallel is a 
geodesic precisely when h'(uo) = 0. 


EXERCISE 2.4. Give an example of a function h(u) so that the surface of revolution 
parametrized by x(u,v) = (u, h(u)cosv, h(u) sin v) has a geodesic parallel at u = 
uo, but up is not a maximum or minimum for h(u). 


In general, for a u-Clairaut parametrization x(u,v) and a unit speed geo- 
desic a, we can reduce the second geodesic equation quite easily to first order. 
This follows just as for the examples of the sphere and torus above. The 
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equation v” + Say! v' = 0 becomes 


" 
v_ / 


a Oils, 
v+=sSsSG 
yl! Cis 


Inve’ =—-InG+ec 


Again, as in the examples, v’ in the unit speed relation may be replaced by 


c/G to give 
1 = Eu’ + Gu” 
2 
2 Cc 
l= Eu" +G a 
2 
1= Eu? +5 
Eu +G 
2 G-e 
~ EG 
G-c? 
—_ 
U + EG 


EXERCISE 2.5. Differentiate either of the last two equations and show that you 
obtain the first geodesic equation. Therefore, again, the unit speed relation takes 
the place of this geodesic equation in the case of a Clairaut parametrization. 


Now if we divide uv’ by u’, we obtain a single integral which serves to charac- 
terize geodesics for a u-Clairaut parametrization. 


dv ou! 
dus ww 
& 
3.28 
G—c? 
+ EG 


vot ae eee 
_ /GVG=e2 
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EXERCISE 2.6. Suppose a(t) = x(u(t), v(t)) is a unit speed geodesic and x is u- 
Clairaut. Show that the Clairaut relation /G sin @ = c holds, where c is a constant 
and @¢ is the angle from x, to a’. Hence, show that a cannot leave the region of the 
surface for which G > c?. 


EXERCISE 2.7. Show that geodesics in the plane are straight lines. Hint: Use polar 
coordinates x(u, v) = (wcosv, usin v). 


EXERCISE 2.8. Let M denote the cone with parametrization x(u,v) = (ucosv, 
usinv,au). Show that a unit speed geodesic a(t) = x(u(t), v(t)) on the cone is 
. e * Vv os . 
characterized by the equation u = c sec ( Fat + D). For a = 1, determine c and 
D for the geodesic connecting (1,0,1) and (0,1,1) and compare the arclength of 
this geodesic between these points to the arclength of the parallel circle joining the 


points. Hint: MAPLE. 


FIGURE 5.4. Geodesic on a cone 


EXERCISE 2.9. Figure 5.5 shows a geodesic on the surface of revolution obtained 
from the Witch of Agnesi. The geodesic is parallel to the parallel circle with u = 7/3. 
Explain the behavior of the geodesic in terms of the Clairaut relation. 
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FIGURE 5.5. Geodesic on the whirling Witch of Agnesi 
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Before we go on, let’s look at the Clairaut relation from a physical view- 
point. Consider a surface M and suppose a particle is constrained to move 
on M, but is otherwise free of external forces. D’Alembert’s Principle in Me- 
chanics (see [Arn]) states that the constraint force F' is normal to the surface, 
so Newton’s Law becomes |F'|U = ma”, where |F| denotes the magnitude 
of F, U is M’s unit normal and a(t) is the motion curve of the particle. 
Taking m = 1, Newton’s Law tells us that the x, and x, components of 
acceleration vanish and, as we have seen, this leads to the geodesic equa- 
tions. Hence, a freely moving particle constrained to move on a surface moves 
along geodesics. Now suppose M is a surface of revolution parametrized by 
x(u,v) = (h(u)cosv, h(u)sinv, g(u)). Along the surface, the radial vector 
r from the origin is simply r = (h(u) cosv, h(u) sin v, g(u)) and the momen- 
tum vector of the particle’s trajectory is given by p = a’ = ux, + v'x, (since 
m = 1). Note that, since a is a geodesic, we may assume its speed is constant. 
The angular momentum of the particle about the origin may be calculated to 


be 


L=rxp 
= ((9'h ~— gh’)u’ sinv — ghv' cosv, —(g/h — gh')u' cosv — ghv' sinv, hv’). 


Because no external forces (or, more appropriately, torques) act on the par- 
ticle, angular momentum, and in particular its z-component, is conserved. 
Therefore, we see that h?v’ = C, where C is a constant. Further, if @ is the 
angle from x, to a’, then 


la’|hsing =a" - xy 
= (u'Xy + U'Xy) + Xy 
=p Ff 
=C 


since cos(5 — ¢) = sing and Xy-Xy = h?. Because |a’| is constant as well, we 
have 
h sin @ = constant. 


But this is precisely the Clairaut relation since h = VG. In this way the 
Clairaut relation is a physical phenomenon as well as a mathematical one. 


EXERCISE 2.10. Let M denote the elliptic paraboloid z = x? + y* parametrized as 
a surface of revolution by x(u,v) = (ucosv, usin v,u) and let a = x(u(t), v(t)) be 
a unit speed geodesic. Find v in terms of u and show that the Clairaut relation is 
usin @ = c. Show that a nonmeridan geodesic spirals up the paraboloid and crosses 
every meridian an infinite number of times. Hints: (1) use the Clairaut relation (2) 
the integral for v diverges, so a can’t approach a meridian as a limit. 
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FIGURE 5.6. Geodesics on a paraboloid 


EXERCISE 2.11. For the paraboloid M and geodesic a above, determine how low on 
M the geodesic a can go in terms of the constant c in the Clairaut relation. What 
happens to the geodesic when it reaches its lowest height? From this, show that a 
nonmeridian geodesic intersects itself an infinite number of times. Hints: You may 
use the following result which we will not prove. Theorem [DoCl, section 4-7, p. 
302]. If a geodesic approaches a parallel as a limit, then the parallel is a geodesic. 
What parallels are geodesics on the paraboloid? 


EXERCISE 2.12. Determine v as an integral for a geodesic on the catenoid. Show 
that the Clairaut relation cosh usin ¢@ = c determines the behavior of geodesics ap- 
proaching the parallel circle with up = 0. Hint: take a geodesic starting at x(uo, vo) 
with @ = ¢o and consider the three cases cosh ug sin go = c < 1, coshugsingg = 
c= 1 and coshug singg = c > 1. 


EXERCISE 2.13. Determine v as an integral for a geodesic on the hyperboloid of one 
sheet x? + y? — z? = 1 and give the Clairaut relation. Show directly that the ruling 
of the hyperboloid which passes through (1, 0,0) satisfies the Clairaut relation. Find 
a closed geodesic on the hyperboloid. Hints: (1) the ruling may be parametrized 
by a(v) = (1,tanv,tanv) (why?) and a parallel circle may be parametrized by 
B(v) = (cosh ug cos v, cosh ug sin v, sinh ug). What happens on the central parallel 
(cosv,sinv,0)? (2) A geodesic a: [a,b] — M is closed if a(a) = a(b). When are 
parallels geodesics? 


5.3 A BRIEF DIGRESSION ON COMPLETENESS 


In the discussion above, at various points, we have implicitly assumed that 
geodesics “run forever.” That is, we have taken for granted that a geodesic 
is a unit speed curve a: R — M whose domain is the real numbers. That 


166 5. Geodesics, Metrics, and Isometries 


this is not always the case is apparent from looking at an example such as 
the plane minus the origin, M = R* — {(0,0)}. We know that geodesics in 
the plane are straight lines, so the same must be true for M since the same 
geodesic equations hold. Suppose a geodesic starts at (r,s) in the direction 
(—r, —s). We know by Theorem 2.1 that there is a unique geodesic of this type 
which heads toward (0,0). Since the geodesic is unit speed, its arclength (i.e. 
distance travelled) corresponds to the length of the interval [a,b] on which 
it is defined. Because the geodesic cannot pass through the missing origin, 
it can only go a distance of less than Vr? + s*. Therefore, the interval [a, b| 
cannot be all of R. What goes wrong here? 

In order to answer this question, let’s make a definition. Say that a surface 
M is geodesically complete if every (unit speed) geodesic has domain R. The 
importance of this definition is reflected in the following beautiful result which 
we will not prove. 


Theorem 3.1 (Hopf-Rinow). If M is geodesically complete, then any two 
points of M may be joined by a geodesic which has the shortest length of any 
curve between the two points. 


When we speak of being a resident of some surface M, we naturally assume 
that we can get from any point to any other by some shortest-distance path. 
If we live in a nongeodesically complete surface however, this is just not so. 
In M = R? — {(0,0)}, we cannot go from (—1,0) to (1,0), say, by a geodesic. 
Indeed, there is no shortest path between these points. So, the question now 
becomes, how do we recognize geodesically complete surfaces? The example 
of the plane minus the origin gives a clue. Recall that a subset M C R? is 
closed if any convergent sequence z; — z with z; € M also hasze M. A 
sufficient condition for a surface to be geodesically complete is given by 


Theorem 3.2. A closed surface M C R° is geodesically complete. 


Proof. Let a be a unit speed geodesic on M. If a(s) is defined, then the 
fact that a arises as a solution to the geodesic equations guarantees that a is 
defined on an open interval (s — r,s +r), for some r. This means that every 
s € R for which a is defined has a small interval about it of other points for 
which a is defined. Hence, the subset of R consisting of points for which a is 
defined is an open set — that is, a union of open intervals. Let J = (a,b) be 
one of these intervals where we assume 6 < o0 and b is a least upper bound 
for J. We shall show that a is, in fact, defined for b as well. Hence, J has no 
least upper bound (i.e. b = oo). A similar argument for a then shows that a 
is defined on the whole real line. 

Let z; — b be asequence of points in J converging to the least upper bound 
b. Since the sequence converges, it is also Cauchy. That is, given « > 0, there 
exists an integer N such that for all n,m > N, |zn — 2m| < €. Now choose 
€ > 0 and look at a(z,) and a(zm). Since a is unit speed, its arclength from 
Q(Zn) to a(Zm) is simply |z, — z2m| < e. Also, as we saw in Chapter 1, the 
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(shortest) distance between two points p, q € R® is given by the line joining 
them and is |g — p|. Hence we have, 


la(Zn) — a(2m)| < |2Zn — 2m| < €. 


This means that the sequence {a(z;)} is Cauchy as well. Now, it is a fact 
of analysis that Euclidean space R” for any n is complete in the sense that 
every Cauchy sequence converges, so we have a(z;) — w for some w € R°. 
But a(z;) € M for all j and M is closed by hypothesis, so w € M. Hence, 
we may define a(b) = w, showing that a may be defined at b as well and 
contradicting b’s definition. We note that we have only shown that a may 
be extended continuously to b. Some rather technical arguments indeed show 
that a extends as a smooth geodesic. O 


EXERCISE 3.1. Which of the surfaces we have studied up to now are geodesically 
complete? 


5.4 SuRFACES Not IN R® 


Up to this point we have taken surfaces in 3-space together with the in- 
ner product structure naturally associated to any Euclidean space, the dot 
product. But there are other inner products which are available. Indeed, any 
nonsingular symmetric n x n-matrix A gives an inner product on R” by taking 
the matrix multiplication 


(x,y) =x’Ay 


for x, y € R”. Here x’ denotes the transpose of the column vector x to a row 
vector. The dot product is simply the inner product which takes A to be the 
identity matrix. In some sense, the geometry which we see on surfaces in 3- 
space reflects the geometry of 3-space itself. If we wish to extend our notion of 
geometry beyond 3-space we must somehow rid ourselves of our dependence on 
the dot product — or, at least, modify this dependence. Recall the definition 
of the dot product in R?. (We use vectors in R* because tangent planes of 
surfaces are 2-dimensional.) Let x = (21,22), y = (y1, y2) € R® and define 


1 0 
x-‘y= (21, 2) F 2) = 21Y1 + L2ye2. 


Now, change the matrix to ce | for a > 0 and find 


1/a 0 | i _ Tiyi + 22ye 


xoy = (21,22) 0 1/a Yo . 
a ey 
7 a 
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where “-” is the dot product and “o” is the modified dot product. All the 
usual properties of the dot product (such as symmetry, bilinearity etc.) work 
for the modified version as well. For an inner product on the tangent planes 
of a surface M the number a may change at each point p € M. That is, we 
may take a to be a function on M, 


a=f(p)? for f:M—>RandpeM. 


We take the square of the function f(p) to ensure that a is positive at each 
point. Thus, for two tangent vectors v,w € FT,(M) we have, 


fp)? 


We say that - and o are metrics of M since they allow us to calculate E, F, 
and G, the basic components of distance along M. The dot product is usually 
called the Euclidean Metric and a metric such as © is said to be conformal (to 
the Euclidean metric) with scaling factor f. 

Now, if these surfaces do not use the Euclidean metric, then where are 
they? While they may sit in R® as sets, their different metrics show that they 
don’t inherit geometry from R%. So, as surfaces with a metric, they do not 
sit inside R?! But if we cannot use the structure of R? — for example, the 
unit normal — how can we understand the geometry of such a surface? As 
is typical in mathematics, when we extend beyond our usual situation, we 
use previous theorems as definitions. Because we no longer can use the unit 
normal U to define Gaussian curvature K, instead we invoke Theorem 3.4.1 
to define kK. Note that this only makes sense because Theorem 3.4.1 ensures 
that K depends only on the metric. Therefore, 


VOW = 


Important Definition 4.1. For a surface with orthogonal metric (i.e. F = 
0), the Gaussian curvature K is defined to be 


«= 5798 (a (Jeg) * au (Zea): 


Of course, because this formula was a theorem for surfaces in R°, this defini- 
tion agrees with our original definition in case M has a metric induced by the 
Euclidean metric of R°. 


EXERCISE 4.1. For a conformal metric with scaling factor f, show that 
K = f(fuu + fov) - (fe — fo): 


A function f is harmonic if fuu + fuv = 0. In this case, clearly, K < 0 and K =0 
only at critical points of f. Define a conformal metric on R? ~ {(0,0)} by taking 
:e— 5 In(u? +v”). Compute K and draw the curves in the plane where K is constant. 
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As the following examples show, we can still define surfaces by taking 
patches x in R*, but with a conformal metric rather than the induced metric. 
We shall see that we can do geometry in this situation just as before. Under- 
standing this abstract differential geometry is the first step to understanding 
the geometry of higher dimensions. 


Example 4.2: The Poincaré Plane P. 
Define P to be the upper half-plane P = {(z,y) € R? | y > 0} with the patch 
x(u,v) = (u,v) and with the conformal metric, 


Wi: We 


3 where Wi, W2 €7,(P) and p= (u,v). 


W1°0W2 => 


This definition of the metric means that the usual dot product is scaled down 
by the height of p (i.e. v) above the z-axis. Let’s compute FE, F and G (at 
p). We obtain 


x, = (1,0) Xy = (0,1) 


1 
B=Xy'Xu =~ Fo=Xy> Xp = 0 G=Xy'Xy=—7. 
v Vv 


Note that G, = 0 and E, = =. Hence 


Ke-—L_{2 { 22/t 
2/1 /ut \ Ou \ /1/v4 


Thus, P has constant curvature equal to ~—1 at each point. P is then a 
“negative curvature” analogue of the unit sphere in R°. 


EXERCISE 4.2. Define M to be the upper half-plane M = {(z,y) € R? | y > 0} 
with the patch x(u, v) = (u,v) and with the conformal metric, 


where wi, w2 € Tp(P) and p= (u,v). 


wi'w 
Wi1°0W2 => d : 
UV 


Calculate the Gaussian curvature K of M. 
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Example 4.3: The Hyperbolic Plane H. 

Define H to be the disk of radius 2 in the plane minus the bounding circle 
with patch x(u,v) = (ucosv,usinv) O<u< 2, 0<v < 2m and conformal 
metric, 


W1° Woe 


a u2/4e where w1,w2 €7,(H) and p= (ucosv, usinv). 


W10W2 = 


EXERCISE 4.3. Show that K = —1 at each point. 


Example 4.4: The Stereographic Sphere S%,. 

Let $2, denote the unit sphere minus the North pole S? — {(0,0,1)} and 
define a map St: S%, — R? (where R? is the zy-plane) by taking the point 
in R? which is the intersection of R? and the line in R* determined by a 
point on the sphere and the North pole N. Formally, given a point p = 
(cos ucos v, sin ucosv, sin v) on 9%,, the line joining p and N is given by 


¥(t) = (0,0,1) + t (cos ucos v, sin ucos v, sin v — 1). 


The line + intersects R? when the third coordinate is zero. This occurs when 
14+ ¢(sinv —1) =0 or t = 1/(1—sinv). Hence, 


cosucosv sinucosv ) 


St(cosucosv, sinucosv, sinv) = =, 
1—sinv 


1—sinv 


Clearly, St is a one-to-one onto map from S?, to R?. Recall that the induced 
linear transformation of tangent vectors St, may be calculated by simply 
taking appropriate derivatives of the image u and v curves. For example, 
fixing a particular v and differentiating gives 


d f{cosucosyv sinucosy 
St. (Xu) ( ) 


du \ 1—sinv’ 1-—sinv 
—sinucosv cosucosu 
l—sinv ’ 1-—sinv’ ; 


Similarly, differentiating with respect to v gives 


62 (<=. _sinu_ 0) | 


1—sinv’ 1—sinv’ 
We may now define a new metric on S%, by saying 
W) © Wo = St.(w1) - St. (we) 


where - denotes the sphere’s induced R° metric. 
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EXERCISE 4.4. In the metric o on S%, show 


2 
cos’ v P25 C= 1] 


Beas te Ae 
(1 — sin v)? (1 — sin v)? 


and calculate the Gaussian curvature K. Explain your answer. 


Example 4.5: The Flat Torus That. 
Let Taat be defined by the patch with range R?+, 


x(u,v) = (cosu,sinu,cosv,sinv) O<u< 2a, O<v< 2rn. 


and with the induced metric (i.e. dot product) of R*. Just as before, we 
may calculate x, and x, componentwise to gett EF = 1, F =O and G = 1. 
Hence, K = 0 and That is flat. Note that this example does not come from a 
conformal metric, but rather from a Euclidean metric of a higher dimension. 


EXERCISE 4.5. Verify the calculations for the flat torus Tg,; and then show that 
the name is apt by defining a one-to-one onto map from Taz to a usual torus. Since 
the flat torus is closed and bounded in Rt, it is compact. Can the flat torus be 
embedded as a surface in R® (ie. That C R® with the induced metric of R°)? 
Explain. 


Beyond having a definition of Gaussian curvature, however, we would like 
to understand the nature of these surfaces in terms of their geodesics. Again 
we are faced with the problem of removing all traces of the unit normal U from 
our previous discussions of geodesics. One way is easy. When we calculated 
the Christoffel symbols for xy, Xuy and Xy,, we never used U. Hence, the 
same formulas hold for surfaces not in R®. In particular, the calculation of a” 
remains the same, but without the final term involving U. That is, 


E. 2 E. Gu 
a” = Xy uy + ay + ayy’ =. stu] 
E 2 G Gy 2 
+ Xy G aa oC u! + a u'v’ += t 


For a to be a geodesic then, just as before, it is both necessary and sufficient 
that the geodesic equations are satisfied. Now let’s look at the original way 
we defined geodesics in terms of geodesic curvature. Recall that, for a unit 
speed curve a, we had 


a" =KgT xU+(a"-U)U. 


Of course, we do not have a U now, so we must drop off the last term and 
somehow make sense of the first. The tangent vector T x U was used before 
to obtain an orthonormal basis for R°, {T, T x U, U}, where T and T x U 
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provided an orthonormal basis for the tangent plane at every point of the 
surface. Although we do not have a unit normal to work with for surfaces not 
in R°, we may still find an appropriate tangent vector in each tangent plane 
which is perpendicular to T. The way to do this is as follows. Given a patch 
x(u, v), define a linear transformation J of each tangent plane by 


1(35) _ Xe 1(=5) Xu 
VE] VG VG VE 
Note that defiriny J on a basis for each tangent plane suffices to define J 
completely. Aisu note that J is defined on unit vectors and has unit vectors 


as outputs. This is to ensure that J has no effect on lengths, but, rather, only 
rotates vectors on each tangent plane. 


EXERCISE 4.6. Suppose a: J > M is a curve on a surface M in R? (i.e. with R°'s 
induced metric). For convenience, take a to have unit speed. Recall that patches 
ate HOw Besured to be orthogonal; that is, F = 0. Show that J(T) = T x U. Hint: 
a =u'xu+v'xy and J is linear. 


Since J(T) = T x U for surfaces in R?, we can take the equation 
peated L.) 


as the definition of geodesic curvature for a unit speed curve a(t). Of course 
the nonunit speed case is just as before, but without the U term. Now, 


J(T) = J(u’xy + v'Xy) 
ae aa 


w/a —x, — ve —Xy 
=H 10! fo xut tg yf aX. 


If we equate the x,, coefficient of this expression for a’’ and the one we used 
to define geodesic equations, we have 


6) 


—U EB Eu, ul Ey n Gu t 
Kg => aera Ae pe age “Ut = Uv. 
v G 2EG v VEG 2VEG 
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We may write all this in a much more meaningful way. Let 6 denote the angle 
between a’ and x,,. Since a is unit speed, we have 


a’ = ux, + v'x, 


Xy x 
= cos§@ —= +sin@ —2 


VE VG 


Therefore, u’/E = cos6 and v'/V/G = sin#. If we differentiate the first ex- 
pression and substitute in the second, we obtain 


ul VE + yee tb _ dcosé 


WE dt 


wiVB 4 ewut E, u'v' dé 


“ = — sin§ — 
WE AWE ee dt 


E,w? — E,u'v’ dé 
a! VE 44+ =-VGv' — 
WE 2/E dt 


—u" [E  Eyw?® — Eyw _ dé 
vu VG wVEG wWEG dt 
Comparing this with the expression for k, above gives the following 


Theorem 4.1. Let a(t) be a unit speed curve in a surface M: x(u,v) and 
let 6 denote the angle between a’ and x,. Then the geodesic curvature of a 
is given by 
dé 1 
kg = IG,v’ — E,w’). 


dt 2/EG 


Note that we have not specified a range for the patch x(u,v). This is simply 
to indicate that the expression for Ky holds for surfaces not in R® as well as 
those in R?. We shall meet up with geodesic curvature again in Chapter 6 
where we shall derive the above formula for k, in a different way. 

Now let’s find some geodesics. Our previous approach in terms of the 
geodesic equations still works here of course. In particular, the notions of 
u and v-Clairaut still have meaning and still simplify the geodesic equations 
greatly. Note, however, that the property of being a Clairaut patch is much 
more dependent on the definition of the metric now than was the case when 
we could safely rely on R?’s dot product. 
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Example 4.6: The Poincaré plane P. 
The patch x(u, v) = (u,v) with the conformal metric, 


Wi'w 
W 1 °OW2 = aT ae where wi, W2 €7,(P) and p= (u,v) 


is v-Clairaut, so, for a = x(u(t), v(t)), the geodesic equations give 


Since the patch is v-Clairaut, the usual procedure for calculating geodesics 
gives, from the first geodesic equation, 


In(u’) = 2In(v) +c 
u’ =cv". 


Plugging this into the unit speed relation 1 = u’?(1/v?) +. v’7(1/v?) then gives 
v' = vv 1 — c*v*. Dividing v’ by wu’ and integrating produces 


fu- {See 
| Jl — ev? 


1 1 
u-d=< | sinwdw where v = — sin w 
Cc Cc 


II 
—_ 
| 
Q 
NO 
e 
i) 


This is the equation of a circle centered on the u-axis. Also, since the patch is 
v-Clairaut, vertical lines (i.e. v-parameter curves) are geodesics as well. So, 
the Poincaré plane P has as its geodesics arcs of circles centered on the u-axis 
and vertical lines. These are the “straight lines” of P. 
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FIGURE 5.7. Geodesic on the Poincaré plane 


EXERCISE 4.7. Show that the parallel postulate does not hold for the Poincaré 
plane. The parallel postulate says that, given a line and a point not on the line, 
there is precisely one line through the point which is parallel to the given line. The 
Poincaré plane is a model for the non-Euclidean geometry of Gauss, Lobachevsky 
and Bolyai. 


EXERCISE 4.8. Find the geodesics for the surface M of Exercise 4.2. 


Example 4.7: The Hyperbolic Plane H. 

We compute geodesics as follows: (1) Since x is u-Clairaut, u-parameter curves 
are geodesics. Hence, radial lines through (0,0) are geodesics. For non-u- 
parameter geodesics, apply the Clairaut integral formula (after simplification) 


to get 
1 —u?/4) 
= (2 (tau /4) 
(< et /4) | 


This appears formidable, but it can be handled as follows. 


c 2 dw 
EXERCISE 4.9. Let w = ——=——== (1+ u*/4} and show dv = ————. 
Ween | /4) V1 —w? 
Then, { dv = sa and v—vp = cos! w. We then write cos(v— vp) = 
| Vi-w : | ° 


WwW Or 


cos(v — Up) = 1+u?/4) 


c 
uv1+4+c? ( 
cos(v — vp) = 44+ u? 


4uvV1 4+ c? 
c 
4uV1+ c? 


ue +4 — ————— ena(v — vp) = 0. 
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To interpret this equation, replace polar coordinates (u,v) by rectangular co- 
ordinates (x,y), = ucosv and y = usinv, expand cos(v—vp) to cos v cos vg + 
sin v sin Up and complete squares. We have, 


4/1+c? 4/1 +c? 


uw +4— =< COR Uo 4 C08? - ——— — Bin Uo usinv = 0 
4/1+c? 4/1+c? 
ge+y*+4— cos 09 2 - AT sin vy y =0 
2V1+c? 2/1+4+c? A(1+c? 
(x — EE ees? +(y- ve ne)? case) +4=0 
Cc c c? 
2V1+c? 


2V1+c? , »o 4 
—— — sin vo) me 


(x — es cos U9)” + (y — 
This equation represents a circle centered outside H (since avltet >2): 
Moreover, this circle intersects the boundary circle of H, x? + y? = 4, at 
right angles. Therefore, the geodesics of the hyperbolic plane are radial lines 
through the origin and arcs of circles centered outside radius 2 which meet H 
orthogonally. 


EXERCISE 4.10. Show that the circle above meets x? + y? = 4 orthogonally. Hint: 
(2vite? Ng = 2? +4 (2)? 
c c : 


Note, again, that, given a geodesic and a point off it, there are an infi- 
nite number of parallel geodesics through the point. The hyperbolic plane 
is also a model for the Gauss-Lobachevsky-Bolyai non-Euclidean geometry. 
Indeed, the hyperbolic plane is isometric to the Poincaré plane. That is, from 
the viewpoint of differential geometry they are indistinguishable. We shall 
consider this more extensively shortly. 


EXERCISE 4.11. Find the geodesics on the stereographic sphere $%,. Hints: (1) the 
patch is v-Clairaut, (2) a convenient substitution may be cosv/(1—sinv) =c sec@, 
(3) just as for the ordinary sphere, geodesics lie on certain kinds of planes — planes 
which always pass through a particular point of the sphere. 


EXERCISE 4.12. Let x(r,@) = (rcos@,rsin@) be the polar coordinate patch for the 
xy-plane. Show that the inverse to stereographic projection is given by 


Q9rcos@ 2rsin@ r*—1 
T+r2? 1+r2? 14r2/° 


St 1 (rcos 6, rsin 9) = ( 


Define a metric on the plane by taking w1 0 w2 = Sty 1(w1) - Sty 1 (wo) where - 
stands for the usual metric on the sphere. This surface is called the Stereographic 
Plane. Show that 

4 Ar? 


aa eae oe oS ages 
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and K = +1. Find the geodesics of the stereographic plane by imitating the 
approach for the hyperbolic plane. In particular, the substitution w = c(1 — 
r?)/2r/1 —c? may be useful. The unit circle and the lines through the origin 
are geodesics. Why? Finally, show that all geodesics intersect the unit circle, the 
angle of intersection 8 obeys cos@ = c, where c is the Clairaut constant and the 
formula for a geodesic becomes (x + tan 3 cos 69)” + (y + tan @ sin 69)? = sec? @. 


5.5 ISOMETRIES AND CONFORMAL MAPS 


Suppose we have two surfaces M: x(u,v) and N: y(r,s) given by the in- 
dicated patches. If there are smooth functions r and s from the domain of 
x to the domain of y, then we may define a map of surfaces J: M — N by 
I(x(u,v)) = y(r(u, v), s(u, v)). We say that I is a (local) isometry if 


Ey = Xy Ox Xy = I,.(Xu) Oy Euixy), Gx = Xy Ox Xy = 1.065) Oy VPC 


Here, as usual, we assume that orthogonal patches are given, so we implicitly 
assume that 0 = Fy = I,(xu) oy I4(x,). By the chain rule, we know that 
I, (Xu) = Yrfut Ys Su and I,(Xy) = Yr Ty + Ys Sy, SO we get a very explicit 
check on whether or not a mapping is an isometry. Namely, we must have 


= ee 2 2 = 3 2 2 
Ey = Xy % Xy = Eyrt, + Gysy Gy = Xy Oy Xy = Eyry + Gysy.- 


Of course this is what we obtained in Exercise 3.2.2 for the special case of two 
parametrizations of the same surface (with J the identity mapping). Moreover, 
the very same calculations (restricted to our situation where F' = 0) give 
EX.Gx = [ruSy — TvSu)EyGy, so that plugging into the formula which defines 
K (and much tedious computation) produces the fundamental 


Theorem 5.1. If I: M — N is an isometry, then the Gaussian curvatures 
at corresponding points are equal. That is, Kw(p) = Kn(I(p)) for allpe M. 


EXERCISE 5.1. In Exercise 3.4.1 it was shown that the surfaces 
x(u, v) = (ucos v, usin v, v) y(r,s) = (rcoss,rsin s,lnr) 


have the same formula for Gaussian curvature, K = 1/(1 + w*)* where w stands 
for u and r respectively. Show that these surfaces cannot be isometric however. 
Hint: an isometry would require r(u,v) = u (up to sign). What goes wrong with 
Ex = Eyr2 + Gys?? 


178 5. Geodesics, Metrics, and Isometries 


FIGURE 5.8. Nonisometric surfaces with the same K 


If the parameter domains are the same and the functions r and s are the 
projections r(u,v) = u and s(u,v) = v, then an isometry simply means that 
the metric is preserved 


E,x=Ey Fe=Fy G,=Gy 


In fact, many authors say that the functions r(u,v) and s(u,v) give a new pa- 
rametrization y(u, v) = y(r(u, v), s(u, v)) of the surface N, so that a mapping 
may be defined by I(x(u,v)) = ¥(u,v) and the question of whether J is an 
isometry is reduced to the simpler criterion above. Of course, the particular 
mapping must be checked to be a parametrization. In order to avoid this and 
to make the examples and exercises below more transparent, we have chosen 
to make the functions r and s explicit. 

Clearly, an isometry preserves the lengths of all tangent vectors (since each 
is a linear combination of x, and x,). Indeed, the word “isometry” comes 
from the Greek for “same measurement.” The Inverse Function Theorem then 
says that each point of M has a small open neighborhood around it so that I 
has a smooth inverse function on that neighborhood. In the case where I is 
one-to-one, onto and has a smooth global inverse J-!: N — M, we say that 
I is a global isometry. Although our definition of “isometry” is not the most 
general one which can be given, it will suffice for all the examples in this book. 
Let’s examine some of these now. 


Example 5.1: The Helicoid. 

Let two surfaces M and N be defined by patches x(u, v) = (ucosv, usin v, v) 
and y(r,s) = (sinhrcoss,sinhrsins,s) respectively. The first patch we rec- 
ognize as the standard helicoid. Define a map I: M — N by I(x(u,v)) = 
y(sinh~' u,v). Now, r =sinh~! u and s = v, so the relevant nonzero partials 
are r, = 1/coshr = 1/V1+4u? and s, = 1. Then we calculate E, = 1 = 
(1/cosh* r) cosh? r = Eyr? and Gy, = 1+? = cosh?r = Gys?. Hence, 
I is an isometry. Clearly, J is one-to-one and onto with smooth inverse 
I~1(y(r, s)) = x(sinhr, s), so I is in fact a global isometry. 
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FIGURE 5.9. Unrolling a cone 


EXERCISE 5.2. Let z = ay/ x2 + y* be the cone with the parametrization x(u, v) = 
(ucosv,usinv, au). Let ¢ denote the vertex angle between the z-axis and the cone. 
Show that sing = 1//1+ a”. If we cut the cone along a ruling line, then we can 
unroll it onto the plane to get a pie wedge. Let the vertex angle of the pie wedge be 
8 and show that 6 = 27sin ¢. The unrolling process is given by y? (u,v) = 


STE v v — 
uv 1 + ta2 cos (=) , uV1+ta? sin (==) , auv 1 =7) 
( V1 + ta? 


V1+4 ta2 


for 0 < t < 1. Check that this map gives the unrolling above. Show that the map 
defined by fixing any t, [(ucosv,usinv,au) = 


—— v v 
uv 1 + ta? cos (==). uv 1+ ta? sin (+=). ouwvT=*) 
( V1 + ta? V1 + ta? 


is an isometry. Finally, show explicitly that J takes geodesics on the cone to geodesics 
(i.e. lines) in the plane. Hint: for the last part, a substitution w = AB/(A — 
Btan(v/V1+)) may be useful, where A = c/sinD, B = c/cosD and c and D 
come from the formula for geodesics on the cone. 


EXERCISE 5.3. Invert a cone with vertex angle ¢ so that it sits on the zy-plane. 
Throw a lasso around the cone with a loop of fixed length Z and pull down on the 
free end of the lasso to tighten the loop around the cone. What happens? More 
specifically, does the lasso slip or does it stay fixed? Does your answer depend on 
the vertex angle? Explain. Hint: (1) what kind of curve does the loop become? If 
the loop stabilizes, what are the only forces acting on it? (2) unroll the cone. 
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FIGURE 5.10. Lassoing a cone 


EXERCISE 5.4. Find an unrolling map for the cylinder x(u,v) = (u,cosv, sin v) 
and show that the geodesics on the cylinder go to straight lines in the plane under 
the isometry I defined by ¢ = 1. Hint: for 0 < t < 1, find the line segment joining 
(u, cos v, sin v) to (u, v, 0). 


The fact that the unrolling maps J for the cone and cylinder carry geodesics 
to geodesics is not a phenomenon special to these surfaces. In fact, 


Theorem 5.2. Let a(t) be a geodesic on a surface M and let I: M — N be 
an isometry. Then, the curve 3(t) = I(a(t)) is a geodesic of N. 


Proof. In case the functions r and s above are the projections, the proof 
is trivial. On the other hand, if r = r(u,v) and s = s(u,v) are general 
smooth functions, then the calculations are quite tedious. We shall take a 
middle road and prove the theorem for the case r = r(u) and s = s(v). 
This will give the flavor of the calculations involved, but avoid unneces- 
sary complications. With this in mind, take I(x(u,v)) = y(r(u), s(v)) and 
T{a(t)) = I{x(u(t), v(t))) = y(r(u(t)), s(v(t))). We wish to show that r(u(t)) 
and s(v(t)) satisfy the geodesic equations in y’s metric. Our assumption on 
r and s implies the following relationships between the metrics for x and y, 


om 2 = 2 = 2 = 2 
Ey = Eyry, Gy = Gys; Ey = Ext; Gy = Gxv5. 
Also, the chain rule gives 
2 W 
rear s=s)v uur vo =v,8 Hr yu bras 


Now we put r and s into the first geodesic expression and substitute the 
calculations above to get 
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since a satisfies M’s geodesic equations. (Note that factoring out 1/u, gave 
ry U, = 1 in the numerators of the middle terms.) Therefore, ( satisfies the 
geodesic equations for N. DO 


EXERCISE 5.5. Show that r and s satisfy the second geodesic equation also. 


EXERCISE 5.6. Use the standard polar coordinate patch x = (ucosv, usin v) for 
the hyperbolic plane H and define a map I: H — P by 


4u sin v 4—u? 
I(x(u, v)) = (gece ae ier eT TOT 0) = y(r(u, v), s(u, v)) 
where P is the Poincaré plane parametrized by y(r,s) = (r,s). Show that I is an 
isometry. The map [ is, in fact, a one-to-one onto map (with smooth inverse) to 
the upper half plane, so it is a global isometry. The formula above results from the 
Mobius (or linear fractional) transformation of complex variable theory, 
2-2 

T(z) = arty 
where z = 2 +%y. As we shall see in Chapter 7, complex dhalysis plays a large 
role in differential geometry. Here is a sample of the complex approach in terms of 
something we already know. Consider the hyperbolic plane H with standard polar 
coordinate u-Clairaut parametrization x(u,v) = (ucosv, usin v). Since radial lines 
through the origin are u-parameter curves, they are geodesics. Now, given any point 
p = (uo, vo) in the disk u = 2, there exists a linear fractional transformation of the 
form 
ja]? — |e|? = 1 


5z+4 
which, in real plane coordinates and with the hyperbolic metric, is an isometry of H 
and which carries the origin to p. Geodesics through p are then images of radial lines 
through the origin under T. But it is known that linear fractional transformations 
take circles and lines to circles and lines. Also, since T is an isometry, and since 
radial lines through the origin meet wu = 2 perpendicularly, so must the geodesics 
through p. Hence, geodesics through p must be circles which meet the boundary 
circle u = 2 at right angles. This is what we calculated earlier. 
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EXERCISE 5.7. Suppose we parametrize the helicoid by x(u,v) = (sinhrcoss, 
sinh r sin s,s). Smoothly bend the helicoid into the catenoid by defining (from t = 
tot = 7) 


t Nearer 
x (u,v) = (sintsinh ucos v — cost cosh usin v, 
sin tsinh usin v + cost cosh ucos v, 


ucost + vsint). 


Only at the last step is the bending not one-to-one. Show that, for every B<t<n, 
I(x(u,v)) = x'(u,v) is a global isometry and for t = 7, I is a local isometry. Also 
show that at each stage Gaussian curvature is preserved (since it is determined by 
the metric) and each intermediate surface is minimal. 


Notice that steregraphic projection from the ordinary (i.e. not the stereo- 
graphic) sphere S?\{N} to R? is not an isometry. Recall that, on tangent 
vectors, we have 


—sinucosv cosucosv 
l-sinv ’ 1l-sinv’ 


St, (x) = ( 
St.(X») = ( 


Hence, taking dot products in R°, we get 


COs U sin u 
1—sinv’ 1—sinv’ 


_ cos? y 

~ (1—sinv)? 

= ] 

~ (= sinv)2 ee 


Sts (Xu) - Sts (Xz) 


1 
Sts (Xv) - St. (xy) = (l—sinv)? 
= ! Xy°X 
~ (1=sinv)27” ~” 


Sts (xX) Ohi Xe) SO: 


While the factor 1/(1 — sin v) indicates that stretching is taking place and, 
therefore, St cannot be an isometry, it also shows that the stretching is uni- 
form and angles are preserved. Taking stereographic projection as our ba- 
sic example, let us weaken the isometry condition by requiring a mapping 
I: M — N to satisfy the following proportionality condition. For orthogonal 
patches x and y, say that I(x(u,v)) = y(r(u, v), s(u, v)) is a conformal map 
if 0 = I,(xx) oy I. (xy) and 


Ey, = X(u, v)? [Eyr? + Gys?| Gx = A(u, v)* [Eyr? + Gys?] 
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for a function A(u, v) called the scaling factor. Again note that, when r(u,v) = 
u and s(u,v) = v, J is conformal exactly when the metrics are proportional 
at each point p € M and its image I(p) € N, 


E, = X(u, v)? Ey Gy = u,v)? Ey. 


EXERCISE 5.8. Show that if J is conformal, then angles between tangent vectors 
are preserved. Hint: it is sufficient to show this for xy and xy. 


Note that stereographic projection is a conformal mapping from the sphere 
to the plane with the metrics induced by R°. This fact will prove to be 
essential for the complex analytic approach to minimal surfaces in Chapter 
7. Furthermore, we shall see in that chapter that the following exercise has a 
far-reaching generalization. 


EXERCISE 5.9. Recall that the Gauss map of a surface M: x(u,v) C R?® is 
denoted by G: M — S? and defined by G(x(u,v)) = U(u,v), where U is the 
unit normal of M. For the helicoid x(u,v) = (ucosv,usinv,v), the catenoid 
x(u, v) = (cosh ucosv, cosh usin v, u) and Enneper’s surface x(u,v) = (u — u2/3 + 
uv’, -—yt+v3 /3- vu*,u" — v’), show that the Gauss maps are conformal with re- 


spective scaling factors \(u,v) = 1+, cosh? u, (1+u? + v7)?/2. 


EXERCISE 5.10. Let x(u,v) = (Rcosucosv, Rsinucosv, Rsinv) denote the usual 
coordinate patch for the sphere of radius R. Define 


I(x(u, v)) = (u, In tan (5 + *) ,0) 
= y(r(u, v), s(u, v)) 


where y(r,s) = (r, 8,0) is a patch for a strip in the coordinate plane. Show that I 
is conformal with scaling factor (u,v) = Reosv. 

Suppose a = x(u, v(u)) is a curve on the sphere which intersects every longitude 
(i.e. a u-parameter curve) at a constant angle. Show that v(u) = Cu/ R? and show 
that such a curve has a straight line as its J-image. The map J is called the Mercator 
Projection of a sphere onto a rectangle. In particular, the Mercator projection may 
be used to construct a map of the Earth. The curve property we have mentioned 
allowed sailors in earlier times to plot courses on a map as straight lines and then 
steer the plotted courses by keeping constant compass headings (i.e. making a 
constant angle with longitudes). Although these courses were not geodesics, the 
ease in navigation made up for the greater distance. For a nice discussion of the 
mathematics of map making see [RW]. 


5.6 GEODESICS AND MAPLE 


It has always been hard for geometers to visualize geodesics. This is due to 
the fact that solutions of the geodesic equations must generally be obtained 
numerically and then, since the solutions are in terms of the parameters u and 
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v, plugged into the surface parametrization. In years past, this process would 
require a great deal of complicated programming to mix the numerical analysis 
with 3D-graphics. Today however, a short and simple MAPLE procedure can 
accomplish the same task. Geodesics may be graphed on their surfaces and 
may give just the right kind of intuitive information to allow rigorous analysis. 
For example, the picture Figure 5.6 provides the intuitive insight necessary to 
solving Exercise 2.10. Of course, it is a simple matter for MAPLE to generate 
the geodesic equations for a given parametrization. The following procedures 
do exactly this. First, we need the metric and all the preliminary procedures. 


> with(plots): 


> dp := proc(X,Y) 
> X(1]*Y(1]+X(2]*Y[2]+X[3]*Y [3]; 
> end: 


> nrm := proc(X) 
> —— sqrt(dp(X,X)); 


> end: 
> xp := proc(X,Y) 
> local a,b,c; 


> a := X[2]*Y(3]-X[3]*Y [2]; 
> b := X[3]*Y[1]-X[1]*Y[3]; 
> c := X[1]*Y[2]-X[2]*Y[1]; 
> [a,b,c]; 

> end: 


> Jacf := proc(X) 


= local Xu,Xv; 

> Xu := [diff(X[1],u),diff(X(2],u) diff(X[3],u)]; 
> Xv := [diff(X[1],v) ,diff(X[2],v) diff(X[3],v)]; 
> simplify([Xu,Xv]); 

> end: 


> EFG := proc(X) 

local E,F,G,Y; 

Y := Jacf(X); 

E = dp(¥/l], Yt) 
F := dp(Y[1],¥[2]); 
G := dp(Y[2],Y[2]); 
simplify((E,F,G]); 
end: 


VVVVV VV 
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Once we have the metric coefficients E, F and G, we can write the geodesic 
equations. As usual, for simplicity, we take orthogonal patches so that F' = 0. 


> geoeq:=proc(X) 

> local M,eq1,eq2; 

> M:=EFG(X); 

> eq1:=diff(u(t),t$2)+subs({u=u(t),v=v(t) }, 
diff(M[1],u) /(2*M[1]))*diff(u(t),t)~2 
+subs({u=u(t),v=v(t) } ,diff(M[1],v) /(M[1]))*diff(u(t) ,t)*diff(v(t),t) 
-subs({u=u(t),v=v(t) } diff(M[3],u) /(2*M[1])) «diff(v(t),t)~2=0; 

> eq2:=diff(v(t) ,t$2)-subs({u=u(t),v=v(t) }, 
diff(M[1],v)/(2*M[3]))*diff(u(t),t)~2 
+subs({u=u(t),v=v(t)},diff(M[3],u) /(M[3])) «diff(u(t),t)*diff(v(t),t) 
+ subs({u=u(t),v=v(t) },diff(M(3],v) /(2*M[3]))«diff(v(t),t)~2=0; 

> eql,eq2; 

> end: 


Finally, we come to the centerpiece of this section, a MAPLE procedure 
which plots geodesics on surfaces. This procedure has a bewildering number 
of inputs, but each serves a function. The first input is, of course, the pa- 
rametrization of the surface in question. The next four inputs give bounds 
for the parameters u and v of the surface. The inputs u0 and v0 give an 
initial point on the surface for the geodesic while Du0 and Dv0 give an initial 
direction. The input n is a vector whose first two components bound the ge- 
odesic parameter and whose third component makes the picture smoother as 
it increases (but also makes the rendering slower). The input “gr” is a vector 
which allows the grid size to be changed from its usual 25 by 25. Again, the 
larger the grid the better the picture and the slower the rendering. Finally, 
the inputs theta and phi orient the output picture. 


> plotgeo:=proc(X,ustart,uend,vstart,vend,u0,v0,Du0,Dv0,n,gr, theta,phi) 

> local sys,desys,dequ,deqv,listp,j,geo,plot X; 

> sys:=geoeq(X); 

> desys:=dsolve({sys,u(0)=u0,v(0)=v0,D(u)(0)=Du0,D(v)(0)=Dvo}, 
{u(t),v(t)}, type=numeric, output=listprocedure); 

dequ:=subs(desys,u(t)); deqv:=subs(desys,v(t)); 

listp:=[seq(subs({u=dequ(j/n[3]),v=deqv(j/n{3])},X), j=n[1]..n[2])]; 

geo:=spacecurve({listp}, color=black,thickness=2): 

plotX:=plot3d(X,u=ustart..uend,v=vstart..vend,grid= [gr(1],gr[2]]): 

display ({geo,plotX},style=wireframe,scaling=constrained, 
orientation=|theta,phi]); 


VVVVV 


end: 


V 
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Let’s consider the torus. The geodesic equations of Example 2.2 are found 
by the following. 


> Tor:=[(5-+cos(u))*cos(v),(5+cos(u))*sin(v),sin(u)]; 
> geoeq(Tor); 


d?u Fa aay dv \? d?y 2sinu dudv 
— inu{—j] = — 
dt? dt dt2 5+ cosu dt dt 


The geodesic of Figure 5.3 is then given by 
> plotgeo(Tor,0,2*Pi,0,2*Pi,Pi/2,0,0,1,{0,240,15],[20,20],0,60); 


Notice that we are starting the geodesic at (u,v) = (7/2,0) with direction 
(0,1) along the top parallel and we let the geodesic parameter run from 0 to 
240 with a smoothness coefficient of n = 15. The grid for the torus is 20 by 
20 and the picture is viewed from 6 = 0 and ¢ = 60 (in degrees). The picture 
gives a beautiful illustration of the Clairaut analysis in Example 2.2. Namely, 
by rotating the picture (in the viewing window) to look from the top down, 
we can see that the geodesic stays on the outside of the torus. 


EXERCISE 6.1. For the Whirling Witch of Agnesi of Exercise 2.9 and Figure 5.5, 
plot the geodesic referred to by the following. 


> Witch:=[2*tan(u),2*cos(u)~ 2*cos(v),2*cos(u)~ 2*sin(v)]; 
> plotgeo(Witch,-1.2,1.2,0,2*Pi,Pi/3,0,0,1,[0,500,15],[20,15],90,60); 


Can you see the effect of the Clairaut relation? 

EXERCISE 6.2. Plot geodesics for various initial directions and starting points on 
the sphere, cylinder, cone and paraboloid using 

> Sph:=[cos(u)*cos(v) ,cos(v) *sin(u),sin(v)]; 

> Cyl:=[cos(v),sin(v),ul; 

> Co:=[uxcos(v),u*sin(v),ul; 

> Par:=[ux*cos(v),u*sin(v),u~ 2]; 


Relate your pictures to examples in the book. In particular, relate the cone and 
paraboloid to Exercises 2.8 and 2.10 respectively. 


EXERCISE 6.3. Plot geodesics for various initial directions and starting points on 
the catenoid and on the hyperboloid of one sheet. 


EXERCISE 6.4. For Exercise 5.2, plot a geodesic with the same initial conditions on 
the unrolling cone for t = 0, t = 0.25, t = 0.5, t = 0.75 and t = 1. This illustrates 
that geodesics on the cone go to straight lines in the plane under the isometric 
unrolling. 
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EXERCISE 6.5. For Exercise 5.7, as the helicoid is isometrically bent into the 
catenoid, plot a geodesic with the same initial conditions for t = 7/2, t = 27/3, 
t = 57/6, and t= 7. 


Finally, for surfaces not in R®, but which are plane regions with conformal 
metric, we can still write MAPLE procedures to calculate the geodesic equa- 
tions. We again need the metric first. The inputs now are a parametrization 
X and a scaling function g. 


> EFGconf:=proc(X,g) 

local E,F,G,Y; 

Y := Jacf(X); 

E := dp(Y[1],¥[1])/g~ 2; 
F := dp(¥ [1], Y2))/e72: 
G := dp(Y[2],¥[2])/g~2; 
simplify ([E,F,G]); 


end: 


VVVVVVV 


Now the geodesic equations for the conformal metric are given by the following 

procedure. 

> geoeqconf:=proc(X,g) 

> local M,eq1,eq2; 

> M:=EFGconf(X,g); 

> eq1:=diff(u(t) t$2)+subs({u=u(t),v=v(t) }, 
diff(M([1],u) /(2*M[1]))*diff(u(t),t)~2 
tbl {um) v=) (OMI) ai) i) 
-subs({u=u(t),v=v(t) } diff(M[3],u) /(2*M[1]))*diff(v(t),t)~2 

> eq2:=diff(v(t),t$2)-subs({u=u(t),v=v(t) }, 
diff(M/[1],v) /(2*M[3]))*diff(u(t),t)~2 
+subs({u=u(t),v=v(t) } diff(M/[3],u) /(M[3]))*diff(u(t),t) *diff(v(t),t) 
+ subs({u=u(t),v=v(t) } ,diff(M[3},v) /(2*M[3}))*«diff(v(t),t)~2=0; 

> eql,eq2; 

> end: 


EXERCISE 6.6. Plot the geodesic on the Poincaré plane given in Figure 5.7. In 
other words, modify the plotgeo procedure to handle this situation. 


Chapter 6 


HOLONOMY 
AND THE 
GAUSS-BONNET THEOREM 


6.1 INTRODUCTION 


Euclid’s parallel postulate states that through any point off a given line 
there is a unique parallel line. The following exercise reminds us of how such 
a postulate may be used to derive essential geometric properties. 


EXERCISE 1.1. Use the parallel postulate to prove that the sum of the angles of 
a triangle is 7. Hint: draw a line through a vertex of the triangle parallel to the 
opposite side. 


At one time it was thought that the parallel postulate could be derived 
from other natural axioms of Euclidean geometry. It was only through the 
work of Lobachevsky, Gauss and Bolyai in the 1800’s that mathematicians 
realized just how misguided this attempt had been. Perhaps it shouldn’t be 
such a surprise that Gauss would see the necessity of actually postulating 
the parallel postulate. His investigations into the geometry of surfaces led 
not only to his Theorem Egregium concerning Gauss curvature (which, in 
fact, had previously been defined by O. Rodrigues), but to an understanding 
of various fundamental geometric quantities on surfaces. In particular, the 
“lines” of a surface may be considered to be its. geodesics and Gauss could see 
that geodesics did not always follow the dictates of the parallel postulate. 


EXERCISE 1.2. Discuss the parallel postulate’s validity for the geometries of the 
sphere and the hyperbolic plane where lines are geodesics. 


As the previous exercise demonstrates, the parallel postulate may fail in 
two ways. Because we have seen above that the parallel postulate may be used 
to calculate the sum of angles in a triangle, we might expect that its failure 
would lead to variability among such calculations. Indeed, we find that there 
are three possibilities for the sum of the angles of a triangle. 
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(1) Euclidean. In the Euclidean plane , the angles of a triangle have 
er oi = 7. 

(2) Hyperbolic (Gauss—Lobachevsky—Bolyai). In the hyperbolic 
plane, the angles of a triangle have en pi <7. 


(3) Elliptic (Riemann). In the sphere, the angles of a triangle have 
yg di Pe T. 


In this chapter, we will see how Gauss curvature affects even the most basic 
quantities of geometry such as angle sums of triangles. Further, we shall 
see that Gauss curvature makes its presence felt even in the most basic of 
geometric notions, the notion of when two vectors are parallel. 

Before we begin, we must recall how to integrate on a surface. Readers of 
Chapter 4 may skip the following redundant discussion. First, consider how 
we compute the surface area of a surface. For example, the surface area of an 
R-sphere in 47R? and the intuitive reasoning behind this is as follows. Let 
M be asurface and consider a patch x. To approximate the area of the patch 
we might find the area of each parallelogram 


FIGURE 6.1. An infinitesimal piece of area 


and then sum them up. Of course, as usual in mathematics, we would then 
take a limit to make our approximations approach the true area. The limit 
then gives a continuous sum, the integral. It is important to note that we are 
defining the integral here. For sufficiently complicated “surfaces” outside our 
restricted smooth definitions, this notion of integration may fail. The area 
of such a parallelogram may be found by Exercise 1.3.7. Namely, Area = 
|x. X Xy|. Hence, we have 


V1 UL 
Area of patch x = / / |Xy X Xy| dudv. 
Uo Ug 
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Example 1.1: The R-sphere. 
Recall that |x, x X,| = R?cosv and0 <u<2n,-$ <v< fF. 


us 


2n = 
Area of R-sphere = i : R? cosv dv du 


Note that, by Lagrange’s Identity (Exercise 1.3.4), |x, x x,| = VEG — F?. 


EXERCISE 1.3. Show that the surface area of the torus is 47rR. 


EXERCISE 1.4. In calculus, two of the usual integrals used to calculate surface area 
are: 


b ner 
(1) Surfaces of revolution: SA = Qn f(x) V1 + f!(x)? dz, 
a 


b pho ——— 
(2) Graphs of functions z = f(z,y): SA = j ds V1 + f2 + f2 dx dy. 
a Cc 


Show that these formulas may be derived from the definition of surface area given 
above. 


Now, how do we integrate a function over a surface? A function simply gives 
different weights to the points in a region and then “adds up” the weighted 
area. Hence, we simply multiply the function by |x, x x,| and integrate. 


[tt ff tlxex laude 


= [| ft VEGF dua. 


Definition 1.2. 


We shall be interested in the integral of a specific function — the Gaussian 
curvature K. The integral f u & is called the Total Gaussian Curvature of M 
(or of a patch x). 
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Examples 1.3. 
(1) The R-sphere. The Gauss curvature is the constant K = 1/R?, so 


1 
[ Kage | [om <mldudy 
1 


= Fa Area 
1 2 
=A4r. 


(2) The Pseudosphere: The Gauss curvature is constant here as well, 
K = —1/c?. Hence, 


1 
i Ka: Area = —-—~ - 2rc? = —2r. 
M 6 Cc 


EXERCISE 1.5. Find the surface area of the pseudosphere. 


EXERCISE 1.6. Find the total Gaussian curvature of the torus and of the catenoid. 


6.2 THE COVARIANT DERIVATIVE REVISITED 


In our initial approach to the geometry of surfaces in R*® we used the co- 
variant derivative to define the shape operator and the various basic curvature 
invariants followed from this. Later, we defined Gauss curvature for surfaces 
not in R® by taking a particular formula for the curvature and using it as a 
definition. In this way we circumvented defining a covariant derivative for such 
abstract surfaces. In this section, we will see that defining and understanding 
a notion of covariant derivative in a more abstract context is worthwhile for 
reasons beyond the definition of curvature alone. 

To begin, let’s take x to be a patch with (as usual) F = 0 and suppose 
there is a closed curve (assumed to be unit speed or, at least, constant speed) 
a(t) = x(a(t),d(t)). Let €& = xu/VE and & = x,/VG and note that 
Ej . Ej = 1] and Eo E2 =] (with Ey : Eo = 0). Thus, Ej and Eo, which we 
say are fields of vectors along the curve, form an orthonormal basis for the 
tangent space at any point x(u,v). For this reason the basis of vector fields 
{E1, Eg} is called a moving frame on the patch. (More generally, such a basis 
may be found even in the case of a nonorthogonal patch.) Further, €;, €2 and 
U form an orthonormal basis for R*. Now, just from this information and 
what we know about covariant derivatives, we will analyze how the vector 
fields €, and €2 change along a. 

First, we will consider the most concrete case of a surface in 3-space. This 
will allow us to see how the covariant derivatives of 3-space and the surface 
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itself relate to each other. Indeed, the latter is defined in terms of the former. 
Then, once we understand what the covariant derivative tells us in this situa- 
tion, we can eliminate 3-space from the discussion and abstract the qualities 
of the covariant derivative to other surfaces. 

We will use the notation V®’ for the covariant derivative (or Jacobian 
matrix) in R® and V for either the projection of V® to the tangent plane 
of the surface or simply for the covariant derivative of M considered without 
regard to 3-space. Let Z = (z!,z?,z%) be a vector field in R® defined by 
functions z*: R? > R. Recall that the R°-covariant derivative V®’ Z is simply 
a coordinatewise version of the directional derivative (see Chapter 2.2). Then, 
for M C R°, the projection of VEZ onto the tangent plane T,(M) is obtained 
by subtracting off its normal component, 


VyZ=VPZ-(VPZ-U)U. 


This is then the definition of the covariant derivative intrinsic to the surface 
itself. 


FIGURE 6.2. Definition of VyZ 


EXERCISE 2.1. Show that ve’ a’ = a". Hence, a is a geodesic if Vga’ = 0. Hint: 
Consider de as a function of the 3 variables a!, a? and a°. This allows the formula 


for the chain rule to apply to a’[f] = a fea) | where f = ae. Then see Chapter 2. 


Now we write VRE: = W11E, + W1E2 + 8,U and VE; E = W2E1 + Woo + 
sgU, where the w’s are functions of the curve a’s parameter t. Using the 
product rule coordinatewise and the definition of the R® covariant derivative, 
we compute 


O=a'[€,-&] =2VEE-€& = un. 
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In general, the formula a/{[V-W] = VE'V-W+V- VEW holds for vector fields 
V = (v!,v?, v3) and W = (w!,w?,w?). This may be seen by the following 
computation (which is reminiscent of the proof of Lemma 2.4.1): 


a'[V -W] =a’ b> viw'| 
= 3 a! [v}w* + va! [w*] 
= \\(VEV)'wi +0(V2, Ww)! 
=VEV-W+V-VEW 
Similarly, since 0 = a’ [E, » E2] = VRE -EgtEr- VE E>, we have Wo] = —w19. 


EXERCISE 2.2. Show s; = S(a’)-€, and sg = S(a’) + €2 where S(q’) is the shape 
operator applied to a’ and s2 is the U-coefficient of ve’ £9. 


Finally, we obtain 
VEE: = waEo + (S(a’) - E1)U 
VF E = —W91E1 + (S(a’) . E2)U. 


Therefore, remembering that the unadorned notation V refers to the projec- 
‘: 3 
tion of V’ onto the surface’s tangent planes, we have 


t VvEi =wai€g  Var€g = —war€1. 


This description of the covariant derivative V in terms of the coordinatewise 
directional derivatives in R?° (i.e. VR’) implies properties such as the following. 


EXERCISE 2.3. Show 

(1) Sum Rule. VfaragaZ = [Var Z+9VaZ. 

(2) Leibniz Rule. Va fZ = 2MUZ + fVyZ. 

(3) Commutation Rule. If x(u,v) is an orthogonal patch, then “mixed second 
partials are equal” 

Vx, Xv = Vx, Xu. 

(4) Compatibility Rule. For a tangent vector w, EF = Xu- Xu, F = 0 (say) and 

G = Xy* Xv, 


w[E] =2 VwxXu “Xu w(G] = 2 VwxXu + Xv. 


These are the types of properties which we can abstract from the situation of 
a surface M in R°. In particular, the last two rules generalize to give a useful 
covariant derivative associated to the so-called Riemannian connection. We 
shall need these later when we discuss the influence of Gaussian curvature on 
surfaces like the hyperbolic plane. 
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6.3 PARALLEL VECTOR FIELDS AND HOLONOMY 


Suppose now that Vo is a unit tangent vector at a(t). One of the first 
questions of classical differential geometry is the following. Along the curve 
a, what tangent vectors should be considered parallel to Vo? After all, the 
notion of vectors or lines being parallel is basic to geometry, so it should not be 
so surprising that the question of parallelism arises in differential geometry 
as well. Because the covariant derivative V/V tells us how the (tangent) 
vector field V changes along a, the condition V_"V = 0 precisely expresses 
the desired parallelism of all the vectors comprising V. In other words, a 
vanishing covariant derivative implies (as it should) that V changes only in 
the normal direction to the surface and so is unchanging from the viewpoint 
of residents of the surface. Therefore, we say that V is a parallel vector field 
along aif VaV =0. 


EXERCISE 3.1. Show that a parallel vector field has constant length. Hint: use the 
Leibniz rule. 


Of course we need to know that parallel vector fields actually exist. In fact, 
we can uniquely determine a parallel vector field along a curve from initial 
data alone. 


Theorem 3.1. Let M be a surface with covariant derivative V and suppose 
a: I > M is acurve on M. For any tangent vector Vo at a(0), there exists a 
parallel vector field V along a with Va(9) = Vo. 


Proof. Without loss of generality take Vo to have unit length. Because a 
parallel vector field must have constant length, in fact every vector in the 
field will have length one as well. Then we may write V = cos@€, + sin €2 
where @ is the angle from V to €,. The desired condition Vg-V = 0 then 
becomes (by product and chain rules) 


dé 
0 = ~sino & Ei +cos@Vq E1 + cos @ — €o+sin8 Va Eo. 


Using our previous calculations of the covariant derivatives of €; and €2 along 
a (formulas f), we obtain 


d 
0=—sin@ [on + Zl €; + cos [on + 4 E9. 


Since sin@ and cos@ cannot be zero simultaneously, we must have ae = —W21 


or 


6(t) => 6(0) = fon dt. 


This formula then defines 6 and, hence, the parallel vector field V. 0 


6.3 Parallel Vector Fields and Holonomy 195 


The construction of a parallel vector field V along a from an initial Vo allows us 
to speak of the parallel transport of Vo along a. That is, the parallel transport 
of Vo along a to a point a(t) is uniquely determined (and defined) to be Vay. 
The angle of rotation (which only depends on a and @(0)), — f wai dt, is called 
the holonomy along a. 


EXERCISE 3.2. Show that a parallel vector field V along a geodesic a makes a 
constant angle with a’. More generally, suppose V is a parallel vector field along a 
and W is a vector field along a of constant length. Show that W is parallel if and 
only if the angle between V and W is constant. Use this to show that the holonomy 
around a curve does not depend on the parallel vector field along the curve. 


EXERCISE 3.3. Prove the following formula for geodesic curvature (see Theorem 
5.4.1). 


dé 
Kg = ade + w1 


Here, 6 is the angle between a’ and €;. Hints: (1) write a’ = cos@€, + sin 6 Eq just 
as for V in Theorem 3.1 and carry out the proof of the theorem to get 


dt 


(2) Note that at, = Vga’ and compare the formula above to the geodesic cur- 
vature formula of Theorem 5.4.1. (3) Why must it be true that, up to sign, 
Tx U =-—sin@€, + cos@ £9? 


Vara’ = E ton [—sin 8 € + cos6 9]. 


EXERCISE 3.4. Let a: [a,b] — M be a unit speed curve on a surface. Take a frame 
along the curve, {T, V = T x U, U}, where U is the unit normal of M. Show that 
the natural (“Frenet”) equations for this frame are 


T’ =KkgV+kU 

V! =—-KkgT —7,U 

U! = -kT —7V. 
Here, k is the normal curvature of T on M, kg is the geodesic curvature of a and Tg 
is defined by t, = —U'-V = V’-U and is called the geodesic torsion of a. Recall 
that a curve a is a line of curvature of M if T is always an eigenvector of the shape 


operator of M. Using the natural equations above, show that Tg = 0 if and only if 
a is a line of curvature. 


EXERCISE 3.5. Let U- N = cos@, where N is the principal normal of the unit speed 
curve a. If r denotes the usual torsion of a, show that the geodesic torsion obeys 


dé 
nn ear) 5 


Show that, if a is a geodesic, then tg = —r. Also, show that the converse of this 
statement does not hold by showing that each term of the above equation vanishes 
for any latitude circle on the sphere. 
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Notice what the proof of Theorem 3.1 and Exercise 3.2 above say in the 
special case of a unit speed u-parameter curve. In this situation, a’ = €1, so 
we have that either the parameter curve is a geodesic, in which case 6 = 6(0), 
or the parameter curve is not a geodesic, in which case the initial vector Vo is 
rotated by the holonomy @ (from the viewpoint of 3-space) as it moves along 
the curve. It is now time for the prime example. 


Example 3.1. Consider the R-sphere with patch 
x(u,v) = (Rcosucosv, Rsinucosv, Rsinv). 


The parameter curve tangent vectors are x, = (—Rsin ucosv, Rcos ucos v, 0) 
and x, = (~Rcosusinv, —Rsinusinv, Rcos v), while the unit normal is given 
by U = (cosucosv,sinucosv,sinv). An orthonormal basis for the tangent 
plane is then 


€, = (—sinu, cos u, 0) Eo = (—cosusinv, — sin usin v, cos v) . 
Now take a latitude circle on the sphere at latitude vo, 
a(u) = (Rcosucos vp, Rsin ucos vp, Rsin vo) . 


Note that |a’| = Rcos vp is constant, so the covariant derivatives of €; and €2 
with respect to a’ truly describe the changes of €, and €2 along a. Further, 
since the parametrization of a is compatible with that of €; and &2 along a 
and a’ = x,,, the R°-covariant derivatives are simply derivatives with respect 
to u, 


VE El = (—cosu, —sinu, 0) 
Ve yer = (sinusinvp, —cosusin vp, 0). 
In order to find Vag €; = wo1€2, we must decompose VEE = w21&2 +5,U 


(and similarly for VE, E2 = —W21€, + s2U). From the formulas above, we 
obtain 


(—cos u, — sin u, 0) = (—w21 cos usin up, —w21 sin usin v9, W21 COS Up) 


+ (81 COS UCOS Up, $1 SIN UCOS Up, $1 SiN Up) 


Therefore, we have 
W21 = sin Ug 81 = —COS U0. 
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EXERCISE 3.6. For V2; 2 = —wo1 £1 + 82U, show that s2 = 0. 


VaEi = sin Vo E2 Var Eg = —sin Vo Ej. 


By the proof of the theorem above, the holonomy along a is given by 


an 20 
- | W1 du = -{ sin Up du 
0 0 


FIGURE 6.3. Holonomy around a latitude circle 


EXERCISE 3.7. Calculate the total Gaussian curvature over the portion of the 
R-sphere above latitude vg and compare your answer to the holonomy around the 
vo-latitude circle. Up to the addition of a multiple of 27, what do you notice? 


The calculation of holonomy above says that, from our outside viewpoint, 
parallel tangent vectors rotate as they move along a latitude circle. Of course, 
as the terminology “parallel” signifies, 2-dimensional residents of the sphere 
see the vectors as parallel — so, from their viewpoint, not rotating at all. 


EXERCISE 3.8. What happens at the Equator and why is the Equator special 
among the circles of latitude? 


EXERCISE 3.9. Give the cone z” = a? (x? + y*) a parametrization 


u COS 6) ; 


: Uv , A U 
x(uv) = (u sin ¢ cos ane: using sin ae! 
where ¢ is the vertex angle of the cone and sing = 1/V1+a?. Take a parallel 
a(v) = (uo sin ¢ cos ine uo sing sin rire uo cos ¢) with 0 < uv < 2msin¢g and, by 
imitating the example above, show that the holonomy around a is —27sin ¢. 
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EXERCISE 3.10. Explain the result of the previous exercise by setting a sphere inside 
the cone so that the latitude v = —¢ circle and the cone parallel a coincide. Show 
that the unit normals of the sphere and the cone (and hence the tangent planes) 
also coincide along the curve. Therefore, the covariant derivatives of the cone and 
sphere agree along the curve as well. Hint: for the cone with vertex at the origin 
and vertex angle ¢, the center and radius of the sphere should be (0,0, uo/ cos ¢) 
and ug sin ¢/ cos ¢ respectively. 


FIGURE 6.4. Cone on sphere along a latitude circle 


6.4 FOUCAULT’S PENDULUM 


Nature uses only the longest thread to weave her patterns, so each 
small piece of fabric reveals the organization of the entire tapestry. 


— Richard P. Feynman 


In 1851 Jean Foucault (1819— 1868) built a pendulum consisting of a heavy 
iron ball on a wire 200 feet long to demonstrate the rotation of the Earth. 
(Foucault was also responsible for developing means for measuring the speed 
of light in various media. In particular, he measured the speed of light in 
air and in water and showed that the speed varied inversely with the index 
of refraction (i.e. Snell’s Law).) Foucault observed that the swing-plane of 
the pendulum precessed, or rotated, as time went on, eventually returning 
to its original direction after a period of T = noe hours, where vo is the 
latitude where the Foucault pendulum resides. The usual explanation for 
this phenomenon of precession is in terms of rotating reference frames which 
produce a horizontal Coriolis “force” displacing the swing-plane (see [Sym] and 
[Arn]). Here we shall look at the Pendulum in the framework of geometric 
phases or holonomy (see [WS], [M] and [Op]). 
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In order to analyze the Foucault pendulum from the viewpoint of geometry, 
assume the Earth to be nonrotating and the pendulum to be situated at 
latitude vo. Instead of the Earth rotating to move the pendulum, we move the 
pendulum once around the latitude circle in 24 hours at constant speed on this 
stationary Earth. This is clearly equivalent to the standard situation in terms 
of the ultimate movement of the pendulum. The long cable of the pendulum 
and the slow progression around the latitude circle have two consequences 
(which are given by the usual physics arguments). 

First, the long cable provides a relatively small swing for the pendulum 
which is then approximately flat. Hence, we may consider each swing as 
a tangent vector to the sphere. By orienting these vectors consistently, we 
obtain a vector field of pendulum swing plane directions V. At each moment 
of time ¢ there is such a swing direction vector V(t) and all these vectors 
may be placed along the latitude circle a(u) by associating a given moment 
of time ¢ with the unique point describing the pendulum’s movement along 
a(u). Hence, we write V(u) for the swing plane vector field. 

Secondly, because we move around the latitude circle slowly, the consequent 
centripetal force on the pendulum is negligible. (In fact, it is approximately 
545 of the downward force mg.) This says that we may take the only force 
F felt by the pendulum to be gravity in the normal direction U. But such a 
normal force does not affect the vertical swing plane of the pendulum tangen- 
tially, so the swing plane appears unchanging to a 2-dimensional resident of 
the sphere. That is, projected to the tangent plane, 


aie) 


Vai V = proj ——— 
where the covariant derivative again reduces to the ordinary derivative due to 
our special parametrization. By our earlier discussion, we then have 


Proposition 4.1. The vector field V associated to the Foucault pendulum 
is parallel along a latitude circle. 


Of course, as we transport the Foucault pendulum once around the latitude 
circle a, holonomy rotates the parallel vector field V by —27 sin vp radians. In 
particular, the angular speed of this vector rotation is then w = 27 sin vo_rads | 
The equivalence of our geometric situation with the physical one then gives 


Theorem 4.2. The period of the Foucault pendulum’s precession is 


27 rads 24 
a hours. 
WwW S1N Up 


Of course, this is precisely the period obtained in physics. Here, however, 
the precession of the swing plane of the Foucault pendulum results from the 
holonomy along a induced by the curvature of the Earth. Further, since we 
view the whole pendulum apparatus as stationary relative to the Earth, what 
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can explain the observed precession of the swing plane? As Foucault argued, 
we must have 


Corollary 4.3. The Earth rotates along its latitude circles. 


EXERCISE 4.1. Suppose a Foucault pendulum is transported around a latitude 
circle on a torus. (You should still assume the only force is normal to the torus.) 
Compute the holonomy and explain whether this experiment alone can tell you 
whether we live on a sphere or torus. 


6.5 THE ANGLE EXcEss THEOREM 


Now let us return to geometry itself. Consider a unit speed simple closed 
curve @ parametrized by 0 < s < 27. Let B denote the region “inside” (@ 
and assume that B is simply connected. That is, we assume that @ may be 
continuously shrunk to a point in B. Of course we may use the same notation 
and formulas for G as for a above. Note, however, that we do not assume 
that our surface is in 3-space. We abstract the properties of the covariant 
derivative from our discussion above (see Exercise 2.8) and simply note that 
such a thing actually does exist. For example, we write Vg/E€1 = w21€2 and 6 
denotes the angle between @’ and €;. Because €; and &2 are orthonormal, we 
may identify w21 in two ways: first, 

VeE1 Eo = wai 
dé 
“9 ds 
by our formula for geodesic curvature (of 3). Secondly, the lefthandside may 


be computed using the definitions of €; and & as x,,/ VE and x, / VG respec- 
tively. Note that @’ = x,u’ + x,v’, so we have 


Si du Xu dv 
vee a=|(7e) G+ (Ze), a] 


7 ee du | XwVE ~%uzE7?Ey dv| 
agg gee 
— (| Xuu  Xufu | du Mig Melly | dv bo 
- (52-88) 2+ 5e-28] 8)-% 
Using our usual formulas for x, and xy, (see Chapter 3.4), we obtain 
Ey Gu 
a xy = er and Xyy°*Xy = es 
Hence, these calculations and the assumed orthogonality of x, and x, imply 
Xy Xv —E, du Gy, dv 


LY Hopes Ae 
war 8 Te VG WEG ds 2/EG ds 
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EXERCISE 5.1. Verify the formula above using the expressions for xXuu, and Xuv 
derived in Chapter 3 (and ignoring U components). 


EXERCISE 5.2. The general formula for w21 derived above allows the determination 
of holonomy in a variety of situations. For instance, consider the Poincaré plane 
P = {(ax,y) € R? | y > 0} with the patch x(u, v) = (u,v) and conformal metric, 
W10W2 = “ where w1, W2 € Tp(P) and p= (u,v). 

Show that the holonomy along the horizontal] line v = 1, from (0, 1) to (a, 1) is equal 
to —a. This means that a tangent vector at (0,1) is rotated by —a radians as it 
moves along v = 1 to (a,1). Hint: Show that, in general for P, w21 = 1/vo along a 
u-parameter curve with v = vg. 


For the following we must recall Green’s Theorem (see Chapter 4). Let C be 
a simple closed curve which may be continuously shrunken to a point in its 
interior. We say such a curve is shrinkable. The line integral ie Pdxr+Qdy 
may be computed as an area integral 


f pac+aa= | [(P- ) dx dy 


where R is the region inside C’. Now, by the computation of w2, above and 
the identification w21 = Kg — a we obtain 


dé Ey, du Gy, dv 


ds "9 29/EGds 2V/EGds’ 


We then integrate both sides with respect to s to get 
Ey, du G, dv 
[qe- [osde= | 2B -sae 
= / Ey du — se: dv 
g2VEG 2VEG 
0 Ey 
= du d 
se alba ( Fea) * 5 ( =)! al 


by Green’s Theorem. Then, by Theorem 3.4.1, we have 


= | [ KVEGdudv 


=|. 


by the definition of integration on a surface (or patch) and the fact that, since 


F=0, |x, X x,.| = VEG — F? = VEG. 
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Theorem 5.1. For a shrinkable simple closed curve 3, the holonomy around 
@ may be identified with the total Gaussian curvature evaluated on the region 
B inside 8. Further, the total change in 6 around (3 is given by 


9(2) — 6(0) = / ute i K. 


Corollary 5.2. If G is a geodesic, then 
6(2m) — 6(0) = | K. 
B 


Now, by our definition of # as the angle between (’ and €;, we see that, 
for a closed curve (, the total change in 6 must be an integral multiple of 27. 


EXERCISE 5.3. Explain this statement. 


Hence, we have 27- A = [x,ds+ f 3X, where A is an integer. Now, to see 
what A is, intuitively we might think of what happens as we shrink ( to a 
point. Since the shrinking process is continuous and the total turning is an 
integral multiple of 27, the total turning must remain constant throughout. 
Near a point, we may think of the curve as a tiny circle in the plane and €; as 
a fixed vector (e; = (1,0) say). In this case it is clear that the tangent vector 
rotates exactly once as it is transported around the circle; that is, the total 
turning is 27. By continuity of the process, we must, therefore, have A = 1. 
To make this argument precise requires a version of the Hopf Umlaufsatz (see 
[Hsi] for example). 

Finally, we come to the main goal enunciated at the start of this chapter — 
an understanding of a fundamental concept in traditional geometry from the 
differential point of view. Consider a triangle (Figure 6.5) which is made up of 
three curves in a surface M and which is shrinkable. We say the angles i; are 
interior angles and their supplements 7 —1, are exterior angles of the triangle. 
Suppose we move around the triangle in the direction shown and consider the 
turning of the tangent vector(s) to the curve(s). The same intuitive argument 
above shows that we obtain a total turning of 27. There are two contributions 
to turning: (1) the contribution obtained as above via the geodesic curvature 
formula, f %g(A)+ f, K and (2) the contributions obtained at the corners of 


the triangle, (7 — 11) + (7 — i2) + (m — is). Hence, we have 


an= fegld)+ f+ (min) + (wa) + (wf), 
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FIGURE 6.5. A triangle in M 


Theorem 5.3. If a triangle in a surface is shrinkable to a point, then 
3 
2a = [rld) +/ K+So(r- ip): 
A : 
j=1 


Moving the angle sum to the other side and multiplying by —1 produces 


= f gf) -| je at Cy ar ae er eer 
A 
=m — (ty +42 + i). 
Corollary 5.4. Ifa triangle in a surface is shrinkable to a point and is made 


up of geodesic segments, then the sum of the interior angles of the triangle 
differs from 7 by (+ or —) the total Gaussian curvature. 


Corollary 5.5. If the surface M has constant Gaussian curvature K, then 
m — (4; tig +43) = —K - Area(A). 


Examples 5.1. 
(1) The R-sphere. K = et so 


1 
k= / 
[¥= wle 


Area of A 
~ RR 
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Hence, }0i; —-7= Area Therefore, on a sphere, 


; Area A 
Da =7t+ R2 > 


which verifies the traditional “parallel postulate” approach. 


(2) The Hyperbolic (or Poincaré) Plane. 


EXERCISE 5.4. Show that the sum of the angles of a triangle in H or P is strictly 
less than 7. 


Thus, curvature affects even the most fundamental of geometric quantities, 
the sum of the angles of a triangle. In this sense, from the differential point 
of view, Gaussian curvature is the basic structural element of geometry. In 
the next section we shall consider a global version of the Theorem above — 
the famous Gauss-Bonnet Theorem. 


6.6 THE GAUSS-BONNET THEOREM 


It is a fact that every surface M may be triangulated. That is, the surface 
may be completely covered by shrinkable triangles in the surface which meet 
only along edges or at vertices. Furthermore, an orientation on a surface 
induces orientations on each of the triangles so that the edge orientations are 
opposite when considered in adjacent triangles (see Figure 6.6). 


FIGURE 6.6. Oriented triangles meeting along an edge 


Suppose M has a triangulation with V total vertices, E total edges and F 
total triangles. Theorem 5.3 applies to each triangle, so we can add up both 
sides of the formula over all triangles to get 


3 


g(A) + K+ m— tj) = ) 2m. 
eae ee 
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Now, M may have boundary curves (which we will assume are smooth) just 
as for a cylinder of finite height. We denote this boundary by 0M. In this 
case, we may split up the sets of vertices and edges according to whether they 
lie on the boundary or in the interior of M. Let V = V;+ Vg and E = E;+E£pz, 
where the subscripts J and B refer to the interior and boundary respectively. 
Then 


i. Kot | K+3nF —2nV; —7 Vp =20F 

aM M 

since the sum of the interior angles at a particular vertex in V7, for all triangles 
intersecting that vertex, is 27 and the sum of the interior angles at a particular 
vertex in Vg, for all triangles intersecting that vertex, is the angle of the 
tangent line to the boundary at that vertex, 7. Furthermore, each triangle 
has 3 edges surrounding it and two triangles share a common edge, except 


for edges in Eg which meet only one triangle. This leads to the equality 
3F = 2E;,+ Eg. Replacing 3F 7 by (2E; + Eg)z, we then have 


| xo + | K+2nk,+7 Eg -—2nV,-7VR = 20 F 
OM M 
| Kot f K+2nE,;+nEgp+7Ep—7Vp—2nrVi-—a7Vp = 20 F 
OM M 
since Vg = Eg. Then 
i Kot f K 4+ 2n(E, + Eg) — 20 (V7 + Ve) = 20 F 
aM M 
/ not | K+2rnBE-27nV=27F 
aM M 


ij kot | K=2n(V-E+F) 
OM M 


Therefore, if we define x(M) = V — E+ F to be the Euler Characteristic of 
M, then we have 


Gauss-Bonnet Theorem 6.1. If M is a compact oriented surface with 
boundary 0M made up of a finite number of smooth closed curves, then 


[ot fw = 2ex(a) 


Corollary 6.2. If M is a compact oriented surface without boundary, then 


[k= 2x(m. 
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Of Course, the one point we have neglected up to this point is that the 
quantity y(M) = V-E+F may not be characteristic of M, but may depend 
on which triangulation is chosen. That this is not the case is a result which 
goes back to Euler. In fact, the result is more general. A graph G consists 
of finite sets of vertices V = {v,} and edges E = {e,;} where e;; denotes the 
unique edge joining v; and v;. The graph is connected if there is an edge path 
joining any two vertices. We shall only consider connected graphs. A graph G 
is embedded in a surface M if there is a map f: G — M such that the images 
of edges never intersect except at vertex images. An embedding is a 2-cell 
embedding if the surface polygons (or faces) determined by the embedding are 
simply connected (and, hence, look like deformed 2-disks). 


EXERCISE 6.1. Embed a small graph in the plane and calculate V —- E+ F. Be 
sure to include the face at infinity, the face which surrounds the entire graph. Do 
the same for 2-cell embeddings on the sphere, torus and cylinder. Be sure your 
embeddings are 2-cell. 


Theorem 6.3 (Euler). Any embedding of a graph in the plane gives V — 
E+F=2. 


Proof. The proof is by induction on the number of edges in the graph. The 
reader can easily check that the result is true for graphs with 1 or 2 edges. 
Let G be a graph with EF edges. There are two cases. The graph G may be 
a tree; a graph enclosing no faces. Then F = V — 1 and F = 1 (the face at 
infinity),soV-E+F=V-(V—1)+1=2 as desired. If G is not a tree, 
then G has a cycle, a path of edges which form a closed curve in the plane. 
This cycle has an inside and outside by the Jordan curve Theorem, so there 
is an edge e which bounds two faces. Remove this edge from G. Note the new 
graph G is still connected since e was on a cycle and the number of faces has 
been reduced to F ~ 1. By the inductive hypothesis on G (since it has E — 1 
edges), V —-(£ —1)+(F -—1) = 2. But simplifying the lefthand side gives 
V — E+ F = 2 which applies to the original graph G. C 


EXERCISE 6.2. Using the proof above as a model, show that the Euler charac- 
teristics y = V — E+ F for the sphere, torus, disk and cylinder are, respectively, 
2, 0, 1 and 0. For the disk (i.e. circle union its interior), use the theorem for the 
plane and remove the face at infinity. Explain why the sphere and plane both give 
V —E+F = 2 for 2-cell embeddings. Hint: stereographic projection. 


If we disregard geometry and only consider shape up to continuous defor- 
mation (for example), then any compact oriented surface (without boundary) 
is known to assume the form of a sphere with handles. A sphere with g han- 
dles, Sg, is obtained by cutting 2g disks out of the sphere and gluing in g 
cylinders along the boundary circles. Of course, the sphere is simply So and 
the torus is 5S}. 
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EXERCISE 6.3. Show that any 2-cell embedding of a graph on Sg gives V- E+ 
F = 2-29. Hence, the Euler characteristic of a compact oriented surface without 
boundary is always of the form 2 — 2g. 


EXERCISE 6.4. We have seen in Chapter 5 that a torus may be given a metric so 
that the Gauss curvature K = 0 at every point. Show, using Gauss-Bonnét, that 
the torus can never be given a metric with K < 0 globally and K < 0 at some 
point. This indicates how a nongeometric (i.e. but deformation) invariant such as 
the Euler characteristic can constrain the geometry of a surface. Further, show that 
any surface with positive Gauss curvature is deformable to the sphere. In fact, such 
a surface must be “topologically equivalent” to the sphere. 


Consider what the Gauss-Bonnet Theorem is really saying. For instance, 
a sphere may be stretched and twisted (without ripping) to produce various 
geometrically distinct surfaces such as the “pretzel” of Figure 6.7. But, while 
the Gauss curvatures of these surfaces are clearly not constant, because these 
surfaces are deformed spheres, the Gauss-Bonnet Theorem says that the total 
Gauss curvatures are always constrained to be 47. Somehow, the deformation 
shape of the surface — which is reflected in the global invariant x — acts to 
determine a global geometric invariant, the total Gaussian curvature. The 
idea of constraining geometry by such deformation invariants has led to many 
major results in Mathematics. In this way, the Gauss-Bonnet Theorem is the 
ancestor of much of the spirit of modern differential geometry and algebraic 
topology. (See [Go] for an interesting perspective on Gauss-Bonnet and the 
Euler characteristic.) 


FIGURE 6.7. A pretzel 


There are other ways in which Gauss-Bonnet constrains the geometry of a 
surface. The following example indicates how geodesics may be affected. 


Example 6.1: The hyperboloid of one sheet. 
Consider M: x? + y? — z? = 1, the hyperboloid of one sheet. Recall that M 
is a surface of revolution parametrized by 


x(u,v) = (cosh u cosv, coshu sinv, sinhu). 


The central (v-parameter) circle a = x(0,v) = (cosv,sinv,0) is a (closed) 
geodesic since h(u) = coshu and h’(0) = sinh(0) = 0. In fact, using the 
Gauss-Bonnet Theorem, we can show that this circle is the only closed geodesic 
on M. To see this, suppose that @ is another closed geodesic on M. 
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There are two possibilities; either a and @ are disjoint or they intersect. 
Suppose they are disjoint. Then a and @ are the smooth boundary curves 
of a portion of the hyperboloid M which can be deformed to a cylinder and 
which, therefore, by the exercises above, has Euler characteristic zero. Now, 
as part of the hyperboloid, M has K < 0 at each point. Hence, Sak <0 
contradicts the Gauss-Bonnet formula tak = 2rx(M) = 0. (Here we have 
used the fact that a and @ are geodesics to obtain [.7,k, = 0.) 

Now suppose that a and @ intersect in (necessarily) two points making 
interior angles ¢;, both of which are less than 7. 


FIGURE 6.8. A possible closed geodesic? 


EXERCISE 6.5. Justify this last statement, ¢; < a. Hint: use the uniqueness of 
geodesics through a point with given initial tangent vector. 


The region R (see Figure 6.8) bounded by @ and @ with interior angles ¢; is 
simply connected and, so, is deformable to a disk. Indeed, we may consider 
the region to be a triangle with one vertex on @ (say) with angle 7 and the 
other two vertices to be the points of intersection with interior angles ¢j. 
Again, since R is part of the hyperboloid, { pK <0. The local Gauss-Bonnet 
Theorem then says 


[K+ (n=) + (nbn) + (= ba) = 2 
[ K= 6146, 
R 
[ K>o 
R 


which is a contradiction. Hence, there are no other closed geodesics. 


EXERCISE 6.6. Show that, on a surface with K < 0, no closed geodesic bounds a 
simply connected region (i.e. a region deformable to a disk). 


6.7 Geodesic Polar Coordinates 209 


6.7 GEODESIC POLAR COORDINATES 


Finally, let us look at an alternative framework for the angle-sum theorem 
above. Although we have mentioned geodesic polar coordinates previously 
in Chapter 5, we have not described their construction. We shall do this 
now and see how they allow a conceptually different approach to the angle- 
sum theorem as well as yet another example of the influence of curvature on 
global geometry. A full (and rigorous) discussion of geodesic polar coordinates 
is rather technical and we will not go into it here. A nice (and relatively 
elementary) discussion may be found in {O’N1] and much of what we do 
below mimics this. Many of the details below are left to the reader. 

Fix a point p € M and let x(u, v) denote a mapping from a portion of the 
plane obtained by using v as the angle of a unit tangent vector w at p from 
a fixed unit tangent vector e; and u as the “distance” along the unique unit 
speed geodesic a with a(0) = p, a’(0) = w. More specifically, let e; and e2 be 
perpendicular unit tangent vectors at p and write w = cosve;+sinve2. Then 
x(u,v) = Qw(u), where the subscript is to remind us that a is determined 
by w, which, in turn, is determined by the parameter v. Of course we must 
restrict u so that x(u,v) is defined, so x(u,v) maps onto a (generally small) 
neighborhood of p. The key result, which follows from the inverse function 
theorem and which we shall not prove, is 


Theorem 7.1. The mapping x(u,v) is a patch for u > 0. That is, x(u, v) 
is a smooth map with smooth inverse from an open set in the plane onto an 
open neighborhood of p € M and is one-to-one for u > 0. 


Note the exclusion of p from the parametrization. This exclusion follows 
for the same reason that ordinary polar coordinates exclude the origin in the 
plane. As we have mentioned, such a patch is called a geodesic polar coordinate 
patch. 


ZI 
KTS 


FIGURE 6.9. A geodesic polar patch 


210 6. Holonomy and the Gauss-Bonnet Theorem 


EXERCISE 7.1. Using the fact that straight lines are geodesics in the plane (with 
the usual Euclidean metric), show that geodesic polar coordinates about the origin 
in the plane are, in fact, ordinary polar coordinates. 


EXERCISE 7.2. Let e; = (1,0), e2 = (0,1) and switch the roles of u and v in 
the usual patch for the unit sphere. Show that this patch provides geodesic polar 
coordinates around the North pole N = x(m/2,v). Here, note that it isn’t necessary 
to begin the geodesic parametrization at t = 0. How much of the sphere does this 
patch cover? 


EXERCISE 7.3. The usual polar patch for the hyperbolic plane x(u, v) = (u cosv, 
u sin v), does not give unit speed radial lines through (0,0). Show that the arclength 
of these lines is given by s(u) = 2arctanh(u/2) and that a unit speed reparametriza- 
tion is then given by 


B(s) = (2 tanh = cos ug, 2 tanh = sin vo) 
for fixed vg. Remember to use the hyperbolic metric here! Now reparametrize the 
patch x and show that you get geodesic polar coordinates around zero. 
The key to using geodesic polar coordinates is the following 
Gauss’s Lemma 7.2. The patch x(u,v) has E =1, F =0 andG > 0. 


Proof. E = Xy +X, = 1 because u-parameter curves aw,(u) are unit speed. 
Hence, x,|E] = E, = 0. The product rule, however, gives 


V=f; 
= Vx, (xy Xi) 
= 2Vx,Xu° Xu 


= 2Vx,Xv ‘Xu 
by the commutation rule, Exercise 2.3. Hence, 
0 = 2Vx, Xv ‘Xu + 2Xy + Vx, Xu 
since u-parameter curves are geodesics with Vx, Xu = 0. So, 


0 = 2Vx,, (Xv * Xu) 
= 2 Fy. 
Therefore, F' only depends on v. But now consider the v-parameter curve 


x(uo,v) and note that, at u = 0, x(0,v) = p is constant. Therefore, at p, 
x,(0,v) = 0 for every v and (for fixed v) 


lim F = lim x,(uo,v) -Xv(uo,v) = 0. 
uo—0 uo—0 
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Since F doesn’t depend on u, F = 0 along this v =constant curve. But the 
same argument applies for every v, so we must have F' = 0 identically in the 
patch. Thus, the u-parameter curves are radial geodesics (from p) and the 
v-parameter curves are the orthogonal trajectories to these geodesics. Finally, 
because x is a patch, G = EG — F? > 0. 0 


Remark 7.1. Note that the proof above contains the fact that VG(0, v) = 
|x,(0,v)| = 0. We will use this below as an initial condition for a very special 
differential equation. 


EXERCISE 7.4. Use the argument of Chapter 5 to show that, for any point q in the 
geodesic polar patch, the radial geodesic from p (i.e. the u-parameter curve) gives 
the shortest arclength of any curve joining p to q. Further, from the argument, infer 
that no other curve gives this arclength. 


Since the point p (called the pole) is a trouble spot for the patch, we 
must be careful in making inferences about the geometry near it. In order to 
understand what happens there, as is our custom now, we first construct a 
frame from the geodesic polar patch, 


Ex = Xu és = 


The u-parameter curves are geodesics, so €; is parallel along them. The vector 
field €2, however, has constant length 1 and maintains a constant angle of 7/2 
with €,, so (by Exercise 3.2) is parallel along radial geodesics as well. That 
is, Vx, €2 = 0. Let 


E1(0) = cos vp €1 + sin vp €2 E2(0) = —sin vp €1 + COS Up €2 


where e;, €2 are fixed unit tangent vectors at p. Let a(u) = x(u,vo) and 
consider the following covariant derivative along this u-parameter curve, 


Vx Xu = Vx, (WG E2) 
= (VG)ubo+ VG Vx, €2 


But €2 is parallel along a, so Vx, €2 = 0. Thus, 


Vx, Xy = Vx,Xu 
= (VG)u £2 
In particular, as we approach p, 


Xuv(0, vo) = lim (VG)u(u, vo))E2(0). 
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We can also compute this quantity another way using x, = €;. We have 
E1(0) = cos vp €1 +8in vp €2 = Xx,(0, vo) and we may write x,,(0,v) = cosve; + 
sinv eg. Then we compute Vx, Xzx(0,v) = —sinve;+cos v eg and Vx, Xu(0, vo) 
= — sin vp e1 + COS Up €2 = E2(0). Therefore, 


lim (VG)u(u, v9) )E2(0) = E2(0) 


and 


lim (VG)u(u, vo)) = 1. 


Notice what this calculation says about the metric near p. We knew previ- 
ously, of course, that E = 1 and F = 0 everywhere on the patch, so the fact 
that G is close to u? near p says that the metric around p is close to being 
Euclidean (in polar coordinates)— at least to first order derivatives. This 
indicates also that curvature should be given by a second derivative of VG. 
This is verified in the important exercise below. 


EXERCISE 7.5. Show that K = —(1/VG)(VG)uu by writing out the chain-rule 
formula for (JV au and comparing the result to that obtained by plugging EF = 1 
into Theorem 3.4.1. A little algebra produces the Jacobi Equation along a radial 
geodesic, (VG)uu + K VG = 0. 


In order to understand the angle-excess theorem from the viewpoint of 
geodesic polar coordinates, we must deal with nonradial geodesics as well as 
radial ones to form a triangle. With this in mind, let 7 be a unit speed geodesic 
which intersects a radial geodesic a(u) = x(u,vo). We may write y(t) = 
x(u(t), v(t)) and note that 7/ = x, 4 +x, 4 with (du/dt)? + G(dv/dt)? = 1 
since 7 is unit speed. Also, the angle @ at which y and a intersect is found 
by taking the inner product (denoted o) of their tangent vectors, 


du 


cos¢ = X,07/ = ore 


Hence, we obtain G(dv/dt)? = 1—(du/dt)? = sin? ¢ and so sind = VG du/dt. 
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Now we differentiate cos ¢@ = xX, 07’ along 7¥ to get 


d 
—sing ° = VXy 07 +Xy 0 Vey’ 
= Viiky Ory 


— / 
= Vx, etx, Xu oY 


du 
os TY xu%u oy + 


dv 
= Ge Y xe %u ° 4 


dy 


dt Ve Xe0 y/ 


_ dv j 
=o (VG)u E207 


= (Z)(VG)u VE. 


Then, replacing sind with /G ay produces 


dd du 

dt ~(VG)u dt 
or, in differential form, 

dd = —(VG)y dv. 


The calculation above relates the angle of intersection of geodesics to the 
rate of change of /G (along a radial geodesic). We can, therefore, apply 
this to a triangle made up of two radial geodesic geodesics and a nonradial 
geodesic which intersects them both to give a different proof of the angle- 
excess theorem. 


EXERCISE 7.6. (Following {Cox]) Suppose a geodesic triangle in M is made up 
of two radial geodesics a(u) = x(u, vo), G(u) = x(u, v1) and a unit speed geodesic 
+(t) = x(u(t), v(t)) which intersects a and @ as shown in Figure 6.10. 
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B 


FIGURE 6.10. A geodesic triangle 


Prove that 
| K=A+B+4+C—n. 
A 


Hints: (1) Use geodesic polar coordinates. (2) Start with the formula for total Gauss 
curvature and justify the calculation below, 


[k=] [KVBG-Pduae 
= [ [ KVGdude 


=f [ -(/G un dude 
= [1- (Gude 


V4 C 
=| au+ | dd 
vo n—-B 


=A+B+C—-rn. 


Now, from the start, we constructed a geodesic polar coordinate patch only 
in a neighborhood of a point p € M. It may (and does) happen that, if we 
try to extend the patch beyond this neighborhood — even to its boundary — 
then we lose the qualities of a patch. Namely, we may lose one-to-oneness or 
we may find that there is a point x(uo, v) with G(uo, v) = 0 so that regularity 
of the patch fails. This is a situation we can analyze and through which, once 
again, we will see the powerful effect of curvature on geometry. 

Let x(u,v) be a geodesic polar patch on M with pole p. Say that p = 
x(uo,v) is conjugate to p along the (radial) geodesic determined by v if 
G(uo,v) = 0. Of course, we can take geodesic polar coordinates about any 
point p € M, so we can get away from taking a specific parametrization when 
we need to and simply consider conjugate points along arbitrary geodesics. 
The following theorem is the key to understanding the relationships among 
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geodesics, conjugate points and shortest arclengths. The same argument as 
in Exercise 7.6 above (as well as Chapter 5) essentially proves the first part. 
The second part is more technical and harder, but plausible from the discus- 
sion above. The third part is the Hopf-Rinow theorem which we mentioned 
in Chapter 5. (In the following, we say that a curve @ is close to a if the 
distance from a(t) to G(t) never exceeds a certain small fixed amount.) 


Theorem 7.3. 


(1) If a is a geodesic joining p€ M tog € M and there are no conjugate 
points to p along a between p and q, then a gives the shortest arclength 
of any curve which is close to a and which joins p and q. 

(2) If p = a(to) is conjugate to p = a(0) along the geodesic a, then 
a cannot give the shortest arclength of any curve (even close to a) 
joining p to any q = a(t) for all t; > to. 

(3) If M is geodesically complete, then any two points of M may be joined 
by a geodesic which has the shortest arclength of any curve between 
the two points. 


Example 7.2: The Sphere. 

The parametrization of the unit sphere obtained by switching the roles of u 
and v in the usual geographic parametrization gives G = cos*u. The North 
pole, u = 7/2, gives G = 0 and the next such zero occurs at the conjugate 
point u = —7/2, the South pole S. Of course a longitude (ie. u-parameter 
curve) is a geodesic which gives the shortest arclength of any curve joining 
N and S, but once we go past S on the great circle to a point q, then the 
geodesic giving the shortest arclength from N to q is the longitude on the 
other side of the sphere. 


FIGURE 6.11. A nonminimum-length geodesic 


Conjugate points may be determined from curvature in the following way. 
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We have seen in Exercise 7.5 that VG along a geodesic obeys the following 
Jacobi differential equation with initial conditions: 


(V@)uu te K VG = 0 
subject to 


VG(0,v) =0 
(VG),,(0,v) =1 


for all v. But the solution to such an equation is uniquely determined by 
the usual uniqueness theorems of differential equations theory. Therefore, in 
order to find a zero of G (i.e. a conjugate point of p) along a geodesic, it is 
sufficient to solve the differential equation abstractly and then find a zero of 
the abstract solution. Specifically, solve the differential equation (which we 
also call the Jacobi equation) 


f’+Kf=0 

subject to 
f(0) =0 
f'(0) =1 


for a function f and then find uo with f(uo) = 0. 


Example 7.3: The Sphere. 

The unit sphere, with the parametrization of Exercise 7.2 (and Example 7.2), 
has K = 1 and initial conditions f(7/2) = 0, f’(7/2) = 1. The general 
solution of f” + f = 0 is given by f(u) = A cosu+ B sinw and the initial 
conditions give B = 0 and A = —1. Then, f(u) = —cosu and the next zero 
for f after u = 1/2 is uy = —7/2 — the South pole. Note that, because radial 
geodesics from the North pole N are unit speed, the arclength to the South 
pole is 
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EXERCISE 7.7. 

(1) For a plane, where and when do geodesics give the shortest arclength? Find 
conjugate points by solving the Jacobi equation with K = 0. 

(2) For the hyperbolic plane, where and when do geodesics give the short- 
est arclength? Find conjugate points by solving the Jacobi equation with 
kK=-1. 

(3) Generalize the first two parts to the following: if a surface M has K < 0, 
then there are no conjugate points along any geodesic. Hence, any geodesic 
gives the shortest arclength for all curves near it. Hints: show that f(u) > u 
by using the initial conditions f(0) = 0, f’(0) = 1 and the Jacobi equation 
to derive the consequence f’(u) > 1. Hence, f(u) # 0, so no conjugate point 
can occur. 


Finally, we come to the main result of this section. In order to give a proof, 
we first need a basic comparison result from differential equations. (Also, we 
use the fact that, just as in the sphere example above, because geodesics are 
unit speed, the parameter u along the geodesic is precisely the arclength along 
the geodesic.) 


Theorem 7.4 (Sturm-Liouville). Let y’’(t) + g(t) y(t) = 0 be a second 
order linear differentia] equation with initial conditions y(0) = 0 and y’(0) # 0. 
Suppose, for all t, that a? < g(t) < b? and let to > O be the first positive 
solution to y(t) =0. Then 

Stos 


13 
e139 


We can apply the Sturm-Liouville comparison theorem to the Jacobi equa- 
tion in the situation where Gauss curvature is positive and bounded away 
from zero. (In order to ensure the proper domains for geodesics, we again 
refer to the notion of “completeness” mentioned in Chapter 5.) 


Lemma 7.5. Suppose M is complete and K > a? > 0. Then any geodesic 
a: [0,00) + M has a conjugate point somewhere in the interval (0, 7/a). 


Proof. We apply the Sturm-Liouville theorem to the Jacobi equation f” + 
K f = 0 to see that the first conjugate point (ie. first zero of f) occurs for 
Uo <7 / a. O 


We can globalize this local result by asking how big M itself can be under the 
curvature constraint K > a? > 0. From Theorem 7.3 above, if M is complete, 
then any two points p,q € M are joined by a shortest length geodesic. Thus, 
it makes sense to define the diameter of M to be the length of the longest 
shortest-length geodesic joining any two points of M. In fact, this length may 
not be attained, so we must take a supremum of such lengths. Formally, we 
define the distance of p from gq, d(p,q), to be the arclength of the shortest 
geodesic joining p and g. Then 


Diam(M) & sup{d(p, q)|p and q € M}. 
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Note that a finite diameter for M@ C R? implies that M is bounded. Recall that 
a closed bounded subset of R® is called compact. The notion of compactness 
is, in fact, more general, but if it is unfamiliar, then the reader should simply 
consider the R? situation. Lemma 7.5 then gives 


Theorem 7.6 (Bonnet). Suppose M is complete and K > a? > 0. Then 
Diam(M) < 1/a and, consequently, M is compact. 


Proof. Let p,q € M be arbitrary points. Since M is complete, there is a 
shortest arclength geodesic joining them. Lemma 7.5 implies that the first 
conjugate point occurs along this geodesic at up < 7/a. Since the geodesic is 
shortest length and we know this cannot be the case after the first conjugate 
point, then the arclength of the geodesic must be < 7/a. Because this is true 
for all p,q € M, then Diam(M) < 7/a. Hence, M is bounded and, because 
it is complete, it is therefore compact. [J 


Once again we see that hypotheses on Gauss curvature have powerful con- 
sequences. We saw in Chapter 3 that compactness implies a local result; 
namely, that the curvature is positive for at least one point of a surface. Now 
we have seen that the global hypothesis of positive curvature (bounded away 
from zero) throughout the surface implies compactness of the surface. The 
following exercises point out the limits of the hypotheses and conclusions. 


EXERCISE 7.8. The hyperboloid of two sheets x” + y? —z? = —1 and the paraboloid 
z= 2" +y? have nonnegative Gauss curvature and are complete since they are closed 
in R°. They are not bounded however, so they are not compact. What goes wrong 
here? 


EXERCISE 7.9. Is the converse of Bonnet’s theorem true. That is, if M is compact 
(say, closed and bounded in R*) with Diam(M) < 7/a, then is it true that K > 
a” > 0? Hint: estimate the diameter of a torus and compare its Gauss curvature. 


EXERCISE 7.10. In Example 7.3 and Exercise 7.7 above, the Jacobi equation f” + 
K f =0 was solved for K = 1, K = 0 and K = -1. Use these solutions to interpret 
how geodesics starting from the same point come together or grow apart. To do 
this, note that a solution (by uniqueness) must be VG for a particular geodesic 
polar patch and that VG = |xy| measures the instantaneous rate of change of a 
radial geodesic in the v-direction. In Chapter 5 we considered geodesics as paths of 
particles constrained to lie on surfaces under no external forces. The interpretation 
above shows that, in particular, equilibrium positions of such particles on surfaces 
of negative curvature exhibit exponential instability (see [Arn, Appendix 1 IJK]). 


Chapter 7 


MINIMAL SURFACES 
AND 
COMPLEX VARIABLES 


7.1 COMPLEX VARIABLES 


At the end of Chapter 4 we hinted that there was a close relationship 
between the subjects of minimal surfaces and complex analysis. In this chapter 
we describe this relationship and use it to “represent” minimal surfaces in 
a very convenient way. Basic references for this chapter are [Oss1], [Ni], 
[DHKW] and [FT]. We shall begin by recalling the basics of complex variable 
theory. 


We denote /—1 by i and the field of complex numbers by C = {z = 4+ 
iy|cz,y € R}. A function f: C — C is continuous at 2p if lim,_.., f(z) = f (zo). 
If this is true for all z) in some open set D, then f is continuous in D. The 
function f is complex differentiable at zo € C if 


jam £62) — f(%) 


220 z— £0 


exists. In this case, the limit is denoted f’(zo). If the limit exists for all z) € D, 
D open, then we say that f is analytic or holomorphic in D. Notice that, al- 
though this definition resembles the usual single variable calculus definition of 
derivative, it is much more subtle since z may approach Zo from any direction 
along any kind of path. We can use this subtlety to our advantage, however. 
Because the range of f is C, we can write f(z) = f(ztiy) = o(a, y)+iv(z, y) 
where ¢ and w are real-valued functions of the two real variables x and y. The 
function ¢ is the real part of f while w is f’s imaginary part. Now let’s assume 
the limit above exists and compute it in the special case of z = iy — 2 = iyo. 
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d(£0,y) + (20, y) — [6(20, yo) + 2¥(Zo0, yo)] 


to i(y — yo) 

— jm $20.9) — (0, Yo) + (Zo, ¥) — W(Zo, Yo) 
yo i(y — Yo) 

= jim Plt0¥) = P(t, 4), 1 (t0,¥) — (a0, Yo) 
yu Yo i(y — yo) i(y — yo) 

_ 106 , ay 

~ 4dy dy 

_ ov _ {06 

— Oy Oy 


EXERCISE 1.1. Show that, when z = x — zo = Zo, then the limit is ge ao ise, 


Of course, if f is complex differentiable at zo, then both of these limits are 
equal to f’(zo) and, hence, to each other. Therefore, we have 


06 db 99 __ day 


Ox «Oy Oy Ox’ 
These are the Cauchy-Riemann equations. In fact, the analysis above may be 
enhanced to show that f is holomorphic on D if and only if ge Be oe oe 
exist and are continuous on D and the Cauchy-Riemann equations hold. We 
note here, without proof, that if f is holomorphic, so are all of its derivatives 


ee eee 


EXERCISE 1.2. Show that f(z) = 2” is holomorphic and compute f’(z) from the 
limit directly and from the Cauchy-Riemann equations. 


EXERCISE 1.3. The complex conjugate of z = x+iy is 7 = x—ty. Show that 
f(z) = Z is not holomorphic. 


EXERCISE 1.4. The modulus of z = x+iy is |z| = x? + y? = V2z. If f(z) =utiv 
and f(z) = u—iv, then we write |f| = Vu? + v2 = / ff. Show that a holomorphic 


function f with |f| constant is itself constant. 


Suppose f is holomorphic. The Cauchy-Riemann equations give 


2h 8 0d0 0 d¥ 

0x2" Oy2 Ar dy Oy dr 
_ 8p Oy 
~ Ody Oydzr 
==0 
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since mixed partials are equal. Thus, ¢ and (similarly) ~ satisfy the Laplace 
equation An = 0 (where A = e + 5) and, so, are harmonic functions. 
Conversely, if @ is a twice differentiable harmonic function of x,y, then on 
some open set there is another harmonic function w such that f = 6+ iw 
is holomorphic. Harmonic functions ¢ and w which give such an f are said 
to be harmonic conjugates. From now on, for convenience, we shall use the 
notation ¢, for the partial derivative with respect to x etc.. 


EXERCISE 1.5. Let ¢@ = x? — y* and find its harmonic conjugate w as follows. 
From Cauchy-Riemann, ¢z = wy, so integrate ¢, with respect to y to get ~ up 
to a function of z alone. Now use ¢y = —wWz to determine this function. Are you 
surprised at your result? 


Integration of complex functions is accomplished through the usual line in- 
tegrals of vector calculus. Suppose f = ¢+i~ is continuous and 7(t): [a,b] - 
C is a curve. Then we define the integral of f along y to be 


[i ~ [ somo dt. 


EXERCISE 1.6. Show that Le a ie d(x, y) dx — W(x, y) dy + if, o(x, y)dy + 
w(x, y) dx where the integrals on the righthand side are real variable line integrals. 


EXERCISE 1.7. Suppose f is holomorphic with continuous derivative on and inside 
a closed curve y. Use Green’s theorem on each integral in Exercise 1.6 to show that 
f. " f = 0. This is a weak version of Cauchy’s theorem. Show that this implies that 
integrals of holomorphic functions only depend on the endpoints and not on the 
paths chosen over which to integrate. 


The most important thing for us to remember is that there is a Fundamen- 
tal Theorem of Calculus for complex integrals. Namely, if f is holomorphic, 
then 


i f=) -F(@). 


Therefore, many of the formulas from ordinary calculus carry over into com- 
plex analysis. 

With a view to the future, when we shall consider a patch x(u,v) with 
complex coordinates, we write z = u+iv, Z = u-— iv and introduce the 
following notation for complex partial differentiation: 


GLO 20) Od a, 9 
az 2\0u av dz. 2\du Ov) * 


One advantage of this notation is that it provides an easy test for f to be 
holomorphic. 
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EXERCISE 1.8. Show that f is holomorphic if and only if of =), 


EXERCISE 1.9. Show that 


Af a een oe =4(3 (2). 


7.2 ISOTHERMAL COORDINATES 


As we saw in Chapter 4, the key to introducing complex analysis into 
minimal surface theory is the existence of isothermal coordinates on a minimal 
surface. Recall that a patch x(u,v) is isothermal if E = x,y-Xy =Xy:Xy =G 
and F = 0. In fact, isothermal coordinates exist on all surfaces, but the 
proof is much harder than the one given below (due to Osserman [Oss1]) for 
minimal surfaces. 


Theorem 2.1. Isothermal coordinates exist on any minimal surface M C R°. 


Proof. Fix a point m € M. Choose a coordinate system for R? so that m is 
the origin, the tangent plane to M, T,,M, is the ry-plane, and near m, M 
is the graph of a function z = f(z,y). Furthermore, the quotient and chain 
rules give 


WwW WwW 


2 
(+2) - (24) =~ [fee(1+ 2) — Mfefufey + fw +2) 
y x 


WwW W 


1 2 
(<4) - (44) = — 2 [fee(1 + $2) — Befyfau + f+ 2) 
‘ y 


where w = ,/1+ f2+f?. As is traditional (and convenient), let p = fz, 
q = fy. Because M is minimal, f satisfies the minimal surface equation 


fx (1 + f?) = 2feby fey P Ffyy( = f2) 


so we have (i422), — (44), = 0 and Ga: — (44), = 0. Define two vector 
fields in the ry-plane by 


2 2 
Ve= (=, 22) ad. Wes (#2, ++) 
W W W W 


and apply Green’s Theorem to any closed curve C’ contained in a simply 
connected region R to obtain 


Lv-[L@).- (2), ea=0 
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bw-JL.C2)-@),e0-s 


Since the line integrals are zero for all closed curves ink, V and W must have 
potential functions (see [MT]). That is, there exist ~ and p with grad(u) = V 
and grad(p) = W. (Here we use “grad” to stand for the gradient to avoid 
confusion with the covariant derivative.) Considered coordinatewise, these 


equations imply pz = itp by = ™! and pz = py = a Define a 


Ww , 


mapping T: R — R? by 
T(z,y) = (c+ (x,y), y + p(z,y)). 


The Jacobian matrix of this mapping is then 


1+p? 
g(ry= |i tee by |_| tt ws 
Po lt+py Mm 14 tr 


(pews > 0. The Inverse 


and we calculate the determinant to be det J(T) = 
0), there is a smooth inverse 


Function Theorem then says that, near m = (0 
J(T 


function T~1(u, v) = (x,y) with I(T") = = J( me 


l 1+itc  — Be otse wt+i+¢ —pq 
det J(T) a a : (1+ w)? —pq wt+l1l+p? 


Of course, the last matrix is just 


ie Ly 
Yu «Yu 


by the definition of the Jacobian. We will put these calculations to use in 
showing that the following patch (in the uv coordinates described above) 


x(u,v) = (x(u,v), y(u,v), f(x(u,v), y(u, v))) 


is isothermal. First we calculate 
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and 


b= Ky, Xu 
= era a [(w+1+4°)? +p2q? + p?(w+1+97)’ 
— 2p?q?(w+1+q?)+p7q°] 
= isoF [(1 + w)?(1+ 9? + p”)] 


w 


cru 


EXERCISE 2.1. Show that x» = (athe tlt p (et) +4q (ttt )) and 


G=Xy:Xy =E and F =0. 


Hence, the patch x(u, v) is isothermal. O 


EXERCISE 2.2. If M: x(u,v) is a surface with isothermal parametrization, show 
that the formula for mean curvature reduces to H = ie Hence, for M minimal, 


l=—n. 


7.3 THE WEIERSTRASS-ENNEPER REPRESENTATIONS 


From now on, unless stated otherwise, we shall take M to be a minimal 
surface described by an isothermal patch x(u,v). We let z = u+ iv denote 
the corresponding complex coordinate and recall 2 = s(x - if). Since 


u= +2 andv= aie). we may write 
x( 252) = (2 (2,.2),8°G2)59 (22) 


We regard x'(z,Z) as a complex valued function which happens to take real 


(cL iene oe ay 
values and we have 3 = 5(z,, — 1z,,). Define 


def OX b.2 3 
wo) = Oz = (2752.2) 


Let’s examine ¢ a bit more closely. We shall use the following notation: (¢)? = 
(at)? + (x2)? + (23)? and |¢|? = |xz|? + |x2/? + |xz|?, where |2| = vu? + v? 


z . . a . . 
is the modulus of z. First, note that (xt)? = 4((x%,)* — (#4)? — 2ixiz}). 
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Therefore, 
i re oe ee 
(¢)* = Z do (ai)? — So(at)? - 21 Sci ct 
j=l j=l j=l 
1 2 2 2 
= 7 (Xu |“ — |xy|° — 2ixy - x,y) 
1 
= q(E- G — 2iF) 
= 0 


since x(u,v) is isothermal. By comparing real and imaginary parts, we see 
that the converse is true as well. Namely, if (¢)? = 0, then the patch must be 
isothermal. 


EXERCISE 3.1. Show that |¢|? = s #0. 


eee se = 2£(%) = = 4x = 0 since x is isothermal. Therefore, each 
¢' = & is holomorphic. Conversely, the same calculation shows that, if each 
¢’ is Holomarpine then each x’ is harmonic and, therefore, M minimal. All 
of these observations together give 


Theorem 3.1. Suppose M is a surface with patch x. Let ¢ = 5, and suppose 
(¢)? =0 (i.e. x is isothermal). Then M is minimal if and a if each ¢' is 
holomorphic. 


In case each ¢’ is holomorphic we say that ¢ is holomorphic. The result above 
says that any minimal surface may be described near each of its points by a 
triple of holomorphic functions ¢ = (¢', ¢*, ¢°) with (¢)? = 0. Indeed, in this 
case we may construct an isothermal patch for a minimal surface by taking 
Corollary 3.2. x'(z,Z) =c;+2Ref ¢' dz. 


Proof. Because z = u+iv, we may write dz = dut+idv. (This is “differential” 
shorthand for 3 = a + if.) Then 
¢' dz = ~[(xi, — ix),)(du + idv)] = 5 [tu du + x}, dv + i(x,, du — xi, du)), 


¢' dz = 


Nir NLR 


Io, 
[(z?, + ixt,)(du — idv)] = 5 (tu du + x, du —i(x;, dv — xj, du)]. 
We then have dz? = Bz. = dz + & oe z= = ¢'dz+ ¢' dz = 2Re¢dz and we can 
now integrate to me Pd O 


So, in a real sense, the problem of constructing minimal surfaces reduces 
to finding ¢ = (¢', ¢”, 63) with (¢)? = 0. A nice way of constructing such a ¢ 
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is to take a holomorphic function f and a meromorphic function g (with fg? 
holomorphic) and form 


¢ = f(t —g) ¢= =F +9) P=fog. 


EXERCISE 3.2. Show that this ¢ satisfies (¢)? =); 


EXERCISE 3.3. Show that g = gr2ay. 


A function g is meromorphic if all its singularities are poles. That is, around 
each singularity z there is a Laurent expansion (generalizing the Taylor ex- 
pansion) of the form g(z) = Gar foes wy + ore a; (z — 20) for 
some finite n with coefficients determined by g. For us, the most important 
examples of meromorphic functions are rational functions g(z) = aa for 


polynomials P, Q. Therefore, we obtain 


Theorem 3.3: Weierstrass-Enneper Representation I. If f is holo- 
morphic on a domain D, g is meromorphic on D and fg? is holomorphic on 
D, then a minimal surface is defined by x(z, Z) = (x1(z, 2), 2?(z,Z), 73(z, Z)), 
where 


a°(z,Z) = Re2] fgdz. 


Before we look at some examples, it is worthwhile noting another form of 
the Weierstrass-Enneper Representation. Suppose g is holomorphic and has 
an inverse function g~! in a domain D which is holomorphic as well. Then 
we can consider g as a new complex variable t = g with dr = g’dz. Define 
F(r) = f/g' and obtain F(r)dr = f dz. Therefore, if we replace g by 7 and 
f dz by F(r)dr, we get 


Theorem 3.4: Weierstrass-Enneper Representation II. For any holo- 
morphic function F(r), a minimal surface is defined by the parametrization 
x(z,Z) = (a! (z, Z), 2?(z, Z),23(z,Z)), where 

a (2,2) = Re [1 — 7*)F(r) dr 

x?(z,Z) = Re [i (1+7?)F(r) dr 


z°(z,Z) = Re 2/ TF (r) dr. 
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Note the corresponding 


p= (50 —1*)F(r), (1 + 77)F(r), F(r)) 
This representation tells us that any holomorphic function F(r) defines a 
minimal surface! Of course, we can’t expect every function to give complex 
integrals which may be evaluated to nice formulas. Nevertheless, we shall 
see that we may calculate much information about a minimal surface directly 
from its representions. In order to look at some of our standard minimal 
surfaces from the representation viewpoint, we need to recall some of the 
basic functions of complex analysis. For this, write z = u+7v and define 


e* =e"(cosu +i sinv) 
and 
log(z) = In Vu? + v? +7 arctan @ 


We haven’t been precise here about branches of the log, but this technicality 
will not concern us. Using the definition of e*, we may define 


; 1Z e . e% —e-2 

sin z = ——————_ sinh z = ————— 
21 2 

e2 +e—% ee +e72 

cos z = ————_ cosh z = ————— 
2 2 


One reason these definitions are chosen is that they extend the usual real 
functions of the same name. For example, if z = u, then the definition of 
e” gives sin z as the real function sinu. Similarly, for z = u, sinh z = sinhu. 
While these complex functions have exactly the same differentiation, integra- 
tion, sum and difference rules as their real cousins, it is often useful to expand 
the complex functions into their real and imaginary parts. To accomplish this, 
we replace z by u+iv and use the definition of e* above. ' 


EXERCISE 3.4. Derive the following formulas: 


(1) sinz =sinucoshv +i cosusinhv. 
(2) cosz=cosucoshv +2 sin usinh v. 
(3) sinh z = sinh ucosu +i cosh usin v. 
(4) cosh z = coshucosv+2 sinh usin v. 
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Example 3.1: The Catenoid. Let F(r) = ree Then, using the substitu- 
tion + = e* below, we obtain 


ot =Re f(1- 7) x yar o = Rei [(L+7%) 5 ar 
=Re [5-5 =Re [i] 545] ar 
= —Re e+ = —Rei 5 
—Re cute = Rei ** 
= —Re cosh z = Re isinhz 


= —2coshucosv. = —2cosh usin v. 


= U. 


We obtain (without the negative signs) a parametrization x(u,v) = 
(cosh u cos v, cosh usin v, u) which we recognize as a catenoid. 


EXERCISE 3.5. Let F(r) = air. Show that the representation associated to F(T) 
is a helicoid. 


EXERCISE 3.6. Show that the catenoid and helicoid, respectively, also have repre- 
sentations of the form (f,g) = (-S-, —e*) and (f,g) = (-i £, —e’”). 


EXERCISE 3.7. Show that the helicoid has another representation (f,g) = (-4, +). 
This representation seems simpler than the first two, but there is a small problem. 
Namely, the integrals of the representation are not path independent. They are then 
said to have real periods. Show that if we carry out the integration for x° (of the 
representation above) about the entire unit circle, we obtain a real period equal 
to 2m. To fix this ambiguity, we take a smaller domain C/{0 UR}. We therefore 
obtain one loop of the helicoid from this representation. Other loops may be similarly 
obtained and fitted together to create the whole surface. Thus, the simplicity of the 
representation (f,g) = (-4, +) hides the subtlety underlying complex integration. 


EXERCISE 3.8. Let F(7) = i(+ — as). Show that the associated representation is 


Catalan’s surface x(u, v) = (u— sinucoshv, 1—cosucoshv, 4sin § sinh 5). Hint: 


after integrating, replace 7 by e~ 2/2 and use the expansion of sin z. 
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EXERCISE 3.9. Let F(7) = 1 —- a. Show that the associated representation is 
Henneberg’s surface 


x(u, v) = (2sinh ucosu — : sinh 3u cos 3u, 


2sinh usin v + : sinh 3u sin 3v, 


2 cosh 2u cos 2v). 


Let v = § and show that you obtain Neil’s parabola (z — 2)3 = 9x7. Hint: for the 
first part, after integrating, replace 7 by e”, use the expansions of sin z and cosz 
and replace y by —y. 


EXERCISE 3.10. Let F(r) = 1 (or, equivalently, (f,g) = (1,2). Show that the 
Weierstrass-Enneper representation gives you Enneper’s surface. If (f,g) = (1, 2”), 
then an n‘-order Enneper surface is obtained. Calculate the x(u,v) for the one. 
order Enneper surface. Hint: MAPLE might be useful here. 


EXERCISE 3.11. Let (f,g) = (z’, or). Calculate the x(u, v) given by the Weierstrass- 
Enneper representation. This surface is called Richmond’s surface. 


EXERCISE 3.12. Let F(r) = ae Show that the Weierstrass-Enneper representa- 
tion gives Scherk’s surface z = In($>4). Hints: (1) for az‘, use partial fractions to get 
i ie dr = i log(1—ir)—i log(1+77), (2) since the log’s are multiplied by 7 and you 
want the real part, the definition of log(z) gives z' = arctan(;4>) — arctan( 775) 
(ie. Ll+ir = 1—vu+iu and 1-—i7 = 1+ -— iu), (3) the formula for the tan- 


gent of a difference of two angles then gives x1 = arctan(y—34a2y)) (4) simi- 
larly, 2? = arctan( 7758452) ) (5) also, 2? = Re(log(r? + 1) — log(r? — 1)) = 


2 2 2 2,,2 
xin (See). (6) by drawing appropriate right triangles, write cos x? 


and cos in terms of u and v and show 2° = In( S825), Note that the integrals 
have real periods so that, just as for the helicoid, we can take one piece at a time 
to build up the whole surface. 


While we have seen that the Weierstrass-Enneper representations give us 
our standard array of minimal surfaces, their true value becomes apparent 
when we realize that we can analyze many facets of minimal surfaces directly 
from the representating functions (f,g) and F(r). In particular, this applies 
even to surfaces whose Weierstrass-Enneper integrals may not be explicitly 
computable. As an example of this, we shall compute the Gaussian curva- 
ture K of a minimal surface in terms of F(r). First note that isothermal 
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parameters give 
1 0 E, O Gu 
#2 [BG va (a (a a) * 3a (7ea)) 
Of Ee Vo | Be 
E i E 
ras 0? 
= a. # ns) 


= at 


where A is the Laplace operator 2+ + 2. By Exercise 3.1, we know that 
E = 2\4|?, so we compute E for ¢ = ($(1—77)F(r), $(1 +7?) F(r), TF(r)). 
We have 


: | 
B= 2)\|5(1—7?)F(r)P +151 + 7)F(O)P + brFr)? 
1 
= slr [th +i + al + 4iel'] 


Now, 7? = u? — v? + 2iuv, so |r? — 1]? = (u? — v? — 1)? + 4u?v?. Similarly, 
|r? + 1]? = (u? — v? +.1)? + 4u2v? and 4|7|? = 4(u? + v2). Then 


1 
E= 5IFI’2 [(u? — v7)? +1 + 4u?v? + 2u? + 2v?] . 
= |F\? [u* + 2u2v? + v4 +1 + 2u? + 2v?] 
=|F? [1t+u? +07)’. 


EXERCISE 3.13. Show directly from the definition of ¢ in terms of the representation 
2 
(f,9) that B= |f/? [1+ |91?] 


Now, nE = In|F|? + 2In(1 + u? + v?) and it is an easy task to prove the 
following. 


EXERCISE 3.14. Show that A(2In(1 + u? + v”)) = ea 

We must now calculate A(in|F|?) = A(n FF) = A(nF'+InF). Previ- 
ously, we have seen that A = 40?/0z0z. Further, because F is holomorphic, 
F cannot be, so OF /Oz = 0 and, consequently, Oln F'/Oz = 0. We are left 
with A(In F) = 46? In F/0z0z = 40(F'/F)/0Z = 0 since F, F’ and, hence, 
F'/F are holomorphic. Thus, A(In|F'|?) = 0 and A(in £) = 8/(1+u? + v?)?. 
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Theorem 3.5. The Gaussian curvature of the minimal surface determined 
by the Weierstrass-Enneper representation II is 


aA 
Kee ee 
[FPR + u2 + v4 


Proof. From the calculations above, 


K= — 55 A(nz) 
: —8 
Sacer 
_ —4 
= Faw 


EXERCISE 3.15. Use the identification F = 4 to derive the formula 
—4|q'|? 


K= ed +92) in terms of the Weierstrass-Enneper representation I. 


EXERCISE 3.16. Explain the apparent discrepancy between the two formulas for 
Kk. That is, the first formula never allows K = 0 while the second has K = 0 at 
points where g’ = 0. Hint: what was our assumption about g which allowed the 
transformation from Weierstrass-Enneper I to Weierstrass-Enneper II. 


EXERCISE 3.17. Recall that a point is umbilic if the two principal curvatures at the 
point are equal. Show that umbilic points on a minimal surface with representation 
(f, 9) correspond to the zeros of g’. Hence, umbilic points on a minimal surface are 


flat. 


EXERCISE 3.18. A minimal surface described by (f,g) or F(r) has an associated 
family of minimal surfaces given by (respectively) (et f,g) or e” F(r). Two surfaces 
of the family described by to and ¢; are said to be adjoint if t; — tg = 5: Show 
that all the surfaces of an associated family are locally isometric. Hints: (1) since 
we have isothermal coordinates, it is enough to show that E remains the same no 
matter what ¢ is taken and (2) |e“| = 1 for all t. 


EXERCISE 3.19. The catenoid has a representation (f,g) = (—S5-, —e*). Write 
the x(u,v) for its associated family and its adjoint surface. 


EXERCISE 3.20. Find the adjoint surface to Henneberg’s surface with F(r) = 
—i(1— =). Set u = 0 and show that the resulting v-parameter curve is the astroid 
2/3 4 y2/3 — (8/3)2/3_ This astroid is a geodesic. 
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So far, we have treated the Weierstrass-Enneper representation more from 
an algebraic and analytic point of view than from a geometric one. But, after 
all, it is the geometry of minimal surfaces which interests us, so it is reasonable 
at this point to connect the discussion above with our usual differential geo- 
metric constructs. For this, recall that the Gauss map of a surface M: x(u, v) 
is a mapping from the surface to the unit sphere S?, denoted G: M — S? 
and given by G(p) = Up, where Up is the unit normal to M at p. In terms 
of the parametrization, we may write G(x(u,v)) = U(u,v) and, for a small 
piece of M, think of U(u,v) as a parametrization of the sphere S?. Recall 
also that the induced linear transformation of tangent planes is given, for the 
basis {X,,Xy}, by G.(xu) = Uy, = —S(x,) and G,(x,) = U, = —S(x,). 


Proposition 3.6. Let M: x(u,v) be a minimal surface parametrized by 
isothermal coordinates. Then the Gauss map of M is a conformal map. 


Proof. Recall from Chapter 5 that, in order to show G to be conformal, we 
need only show |G.(xx)| = p(u,v)|xu|, |Gs(xv)| = p(u, v)|x,| and G, (xu) - 
G,(Xy) = p?(u,v) Xu + Xy. Since isothermal coordinates have E = G and 
F = 0, we get 
l m m n 

G, (Xu) =U, = ~ Bx = EY ) G,.(X») =U, = ~ exe i= E 
where we have used the formulas for U, and U, developed in Chapter 5. 
Taking dot products gives 


xy 


WP = (P+ m7], Ul = % fm? +n? 


Uy Uy = Sl +n), 


But, by Exercise 2.2, 1 = —n, so we obtain 


1 
(Cul? = (1? + m?] =|U,|? and Uy-Uy =0. 
Since |x,| = VE = |x,| and x, -x, = 0, we see that the Gauss map G is 


conformal with conformality factor Vl? + m?/E. O 


EXERCISE 3.21. Show that the conformality factor V1? + m2/E is equal to / |K], 
where K is the Gaussian curvature. 


EXERCISE 3.22. Suppose M: x(u,v) is a surface whose Gauss map G: M — s? 
is conformal. Show that either M is (part of) a sphere or M is a minimal surface. 
For simplicity, assume that the patch x(u,v) is orthogonal (but not necessarily 
isothermal). 
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EXERCISE 3.23. Show that a nonplanar minimal surface M: a v) cannot have 
constant Gaussian curvature. Hint: (1) if so, then define M: y(u,v) = /|K]| | x(u, v) 
and show that M has constant Gauss curvature —1 as well as the same unit nor- 
mal as M. (2) Show the Gauss map of M is a local isometry. (3) Why is this a 
contradiction? 


Recall that stereographic projection from the North pole is denoted by St: 
S?/{N} — R? and defined by 


cosucosuv sinucosv ) 


St(cos ucos v, sin ucos v, sinv) = : ia 
1—sinv 1 —sinv 


We have seen previously in Chapter 5 that stereographic projection is confor- 
mal with conformality factor 1/(1 — sinv). In cartesian coordinates, stereo- 
graphic projection is simply St(z, y, z) = (x/(1—z), y/(1—z), 0). We may iden- 
tify R? with C and extend St to a one-to-one onto mapping St: S? + CU{oo} 
with the North pole mapping to oo. With these identifications, we have 


Theorem 3.7. Let M: x(u,v) bea minimal surface in isothermal coordinates 
with Weierstrass-Enneper representation (f,g). Then the Gauss map of M, 
G: M + CU {oo}, may be identified with the meromorphic function g. 


Proof. The proof is a long calculation wince a? together all of our complex 
analytic ingredients. Recall that ¢ = ox b= = 55 and 


¢" = 5f(l— 9?) = sf(1 +9?) ¢° = fg. 


We will describe the Gauss map in terms of ¢!, ¢? and ¢°. First, we write 


oe ki = ((Xu SSE. sae) x a) Sa = 23.22, aah — 


aig3, ai? — 2? 21). Let’s consider the first component (x, x x,)' = x23 — 
meee, We have 
aa? — 23x? = Im| (22 — ix?)(x> + ir?)] 
= Im[2(0x?/0z) - 2(0x*/2)| 
= 4Im(¢°¢"). 
Similarly, (x, x xy)? = 4Im(¢°¢") and (x, x xy)° = 4Im(¢'¢?). Hence, 
Xy X Xy = 41m(979*, $79", d'9") = 2¢ x 4), 


where the last equality follows from z — Z = 2Imz. Now, since x(u,v) is 
isothermal, |x, x Xy| = |[xu|-|x.| = |xu|? = E = 2|¢|?. Therefore, 
Xy X Xy (ox) _ ox¢@ 


U = OOOO FO SS 
IXu XX] 2/9? |ol? 
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We now compute the Gauss map G: M > CU {oo}. 
G(x(u, v)) = St(U(u, v)) 


oxd 
8 (Fe) 
= (ales oP °F) 
lol? 
(ee) 
||? — 2Im(¢14?)’ |b]? — 2Im(g'¢?)’ 7” 
The last equality follows since 
z  _ 2Im(¢7¢*) 1 
I-26 — Bm) 
_ 2Im(g6%) I? 
|9/? |o|? — 2 Im(¢'¢?) 
2Im(¢?¢") 


~ [dP = 2Im(9" 4?) | 
and similarly for y/(1 — z). Identifying (x,y) € R? with z + iy € C allows us 
le 21m(4268) + 2iIm(*9!) 
CO) Sg BNO) 
Now, let’s consider the numerator N of this fraction. 
N = 2Im(¢7¢°) + 2i Im(¢°¢") 
= F10°d — F99° + 19°54 iF 
= $°(g' +19") — H°(G' +74"). 
Also, 0 = (¢)? = (61)? + (6°)? + (6°)? = (¢' —1¢?)(g' +147) + (¢°)?, 80 


—(¢°)? 
¢d+i¢?= Big’ 
Then we have, 
- ee - ¢° 2 
N=8(P +i) +8 ee 

_ PIG -1 ge )(H* +247) +1677] 
¢| —i¢? 

= [|o"? + [6 /? + 19° |? + i(6°6" — ¢76")] 


ee op 


a aim [|¢|? es 2Im(¢"¢”)| 


7.4 Bjorling’s Problem 235 


Hence, the second factor of the numerator NV cancels the denominator of 
G(x(u, v)) and we end up with . 


¢? 
G(x(u, v)) = ate 
By Exercise 3.3, we know that g = ots as well, so we are done. 0 


Using the Weierstrass-Enneper representation II, we see that the Gauss map 
may be identified with the complex variable 7 as well. , 


EXERCISE 3.24. A point p = x(uo, vo) is a pole of the Gauss map G: M — CUoo 
if G(p) = oo. By the definition of stereographic projection, p is a pole of G if and 
only if U(p) = (0,0,1), the North pole. Verify this by showing that U(p) = (0,0, 1) 
if and only if the denominator of the following expression for G vanishes: 


21m(¢?43) + 21 Im($34") 
Seta a ataelaay 


EXERCISE 3.25. Prove Bernstein’s Theorem: If a minimal surface M: z = f(x,y) 
is defined on the whole zy-plane, then M is a plane. Hints: (1) In the proof of 
the existence of isothermal coordinates, if the parameter domain is the whole plane, 
then the potential functions , and p can be extended over the plane as well. The 
mapping T then becomes a diffeomorphism (i.e. a smooth one-to-one onto map with 
smooth inverse) between the ry- and uv-planes. Therefore, we may assume that M 
has parameter domain the whole uv-plane where u and v are isothermal coordinates. 
(2) The normals for M are contained in a hemisphere. Rotate the sphere to get them 
in the lower hemisphere. (3) Think of the wv-plane as the complex plane C and look 


at the composition C — M S 3? /{N} — C. Why is this map holomorphic? Think 
of g. (4) Liouville’s Theorem in complex analysis says that a complex function 
defined on the entire complex plane which is both bounded and holomorphic is 
constant. : 


7.4 BJORLING’S PROBLEM 


Often, we wish to create minimal surfaces having preassigned properties. 
For example, in Plateau’s problem, we desire a minimal surface which spans 
a given boundary curve. In this case, preassigning the values of the minimal 
surface on the boundary puts a considerable constraint on construction of 
the surface. There are other natural constraints, however, which are easier 
to handle, but no less interesting. Suppose we wish to construct a minimal 
surface which contains a prescribed path of a free particle constrained to move 
on it. Since free particles move along geodesics (see Chapter 8), we are asking 
for a minimal surface containing a given curve as a geodesic. Can we solve 
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this problem? The answer turns out to be yes. In fact, although we shall 
give the proof of the solution below for the geodesic problem, it is possible to 
formulate the problem and solution for any vector field along the given curve 
which is orthogonal to the curve (see Exercise 4.1 below). First of all, let’s be 
precise about the question. 


Bjorling’s Problem 4.1: (Special Case). Given a real analytic curve 
a: I — R® (with a'(t) = 0 only at isolated points), construct a minimal 
surface M: x(u,v) having u-parameter curve a(u) = x(u,0) such that the 
normal N of a is the unit normal of M, N(u) = U(u,0), forue I. 


As usual, we may assume a is parametrized by arclength when it is conve- 
nient. Recall that a curve in a surface is a geodesic if its acceleration is always 
parallel to the unit normal of the surface. Since the direction of the acceler- 
ation is precisely the direction of N, we see that the condition of Bjorling’s 
problem is that the given curve be a geodesic in the surface. To solve Bjérling’s 
problem, we must delve deeper into complex analysis then we have so far. For 
example, the curve a is required to be real analytic (i.e. have a Taylor series 
which converges on J) because we can convert such curves automatically to 
their complex counterparts by replacing the real variable in the Taylor series 
of a by a complex variable z. The resulting complex Taylor series defines a 
holomorphic complex curve a(z): D > C? on a domain D C C containing 
the interval I. We say that a(z) is a holomorphic extension of a(t). Similarly, 
N(t) may be extended to N(z). The existence of a Taylor series for holomor- 
phic functions has a powerful consequence which we shall use below. For a 
proof, see [MH] for example. 


Identity Theorem 4.1: (Principle of Analytic Continuation). If f 
and g are two holomorphic functions on a connected, open region D C C and 
f(z) = g(z) for some convergent sequence 21, 22,...,2n,-.. + Z in D, then 
f =g on all of D. 


This beautiful and surprising result will be the key to our solution of Bjérling’s 
problem. So, given a(t) and N(t) as above, let a(z) and N(z) be the respective 
holomorphic extensions on a simply connected domain D containing I. We 
then have 


Theorem 4.2: (Solution to Bjérling’s Problem). There is exactly one 
solution to Bjorling’s problem and it is given by 


x(u,v) = Re la ~ if N(w) x a’(w) dw} , 


where uo € I is fixed (and w is a complex dummy variable). 


Proof. We start by proving the solution is unique if it exists. Suppose x(u, v) 
is a solution to Bjorling’s problem. That is, x(u, v) is a minimal surface (with- 
out loss of generality, in isothermal coordinates) having x(w,0) = a(u) and 
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U(u,0) = N(u). Since x(u, v) is isothermal, each coordinate function z‘(u, v) 
is harmonic. Let y’(u,v) denote the corresponding harmonic conjugate (with 
y’(uo, 0) = 0 for definiteness) and note that x*(z) + iy’(z) is holomorphic. We 
can create a holomorphic curve 3: D — C? by defining, for z = u+iv € D, 


B(z) = x(u,v) +ty(u, v). 


Now, @’(z) = Xx, + ty, = X, —iX, by usual complex differentiation and 
the Cauchy-Riemann equations. But x(u,v) is in isothermal coordinates, so 
|x, XxXy| = |x,|-x,| = VE- VG = E and, hence, x, xx, = EU. This implies 
that x, =U x x, and we obtain 


B' =x, —iU x xy. 


If we restrict to (u,0) € I, then we have @’(u) = a'(u) —i N(u) x a’(u) since 
X,(u, 0) =a’(u) and U(u,0) = N(u). We integrate this equation in the usual 
real variables sense to get G(u) = a(u) —i i ie N(t) x a’(t) dt for all u € I. 
(Here we have also used the initial conditions y’(u,0) = 0 to determine the 
constant of integration.) But now we see that, on the whole interval J, G 
agrees with the holomorphic curve y(z) = a(z) —i f N(w) x a'(w) dw. The 
Identity Theorem then says that @ and y agree on all of D. Then, since the 
real part of 3 is x(u,v) by definition, we get the required form 


x(u,v) = Re ale) =j Nw) x a! (w) aw 


To prove that a solution to Bjorling’s problem exists, we define a holomor- 
phic curve by G(z) = la(z) =a) N(w) x a’(w) dw! on a domain D where 
the power series converge. Note that, since N(z) and a’(z) are real for z € I, 
Re B’(z) = a’(z) and Im #’(z) = —N(z) x a’(z). Let us denote 3’ by ¢ to 
match up with our earlier work. For z € J, the usual rules of vector calculus 
apply to give a’-(N xa’) =0 and |N xa’| = |N|-|a’| = Ja’|. Then, for z € I 
we have 


(¢)* = a'(z) -a!(z) — 2ia'(z) - (N(z) x a'(z)) 
— (N(z) x a’(z)) + (N(z) x a"(z)) 
= |a’/? -0-|N xa’ |? 
= 0, 
Again, the Identity Theorem implies that the holomorphic function (¢)? must 


be zero on all of D. But this is exactly the situation which led to the 
Weierstrass-Enneper representations. Hence, we know by Theorem 3.1, that 
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the real part of { ¢dw = @ is a minimal surface in isothermal coordinates. 
That is, we have a minimal surface defined by 


x(u,v) = Re late) -if N(w) x a’(w) dw] . 


We must still check the conditions of Bjorling. Since N(u) and a’(u) are real 
for u € I, x(u,0) = Re @(u) = a(u) and the first condition is satisfied. Also, 
for u € I, we may compute (’(u) two ways, from the definition of 6(z) and 
from the harmonic conjugate process used in the proof of uniqueness. We get 


a’(u) —i(N(u) x a’(u)) = B’(u) = xu(u, 0) — ix,y(u, 0). 


Equating real and imaginary parts gives x,(u,0) = a’(u) and x,(u,0) = 
N(u)xa’(u). But recall that isothermal coordinates entail x,(u,0) = U(u, 0)x 
x,,(u, 0) = U(u,0) x a’(u) and, consequently, N(u) = U(u,0). Thus, the sec- 
ond condition of Bjorling is satisfied and we are done. O 


EXERCISE 4.1. Show that the solution of Bjorling’s problem presented above carries 
through without change for any real-analytic unit vector field along a, N, with 
N(t) - a(t) =0. Recall that a vector field N along a: I > R? is simply a smooth 


map N: I > R® defined on the same domain as a. We think of the vectors N (t) as 
originating from the points in three-space belonging to the image of a. 


EXERCISE 4.2. For the situation where, as in the special case of Bjorling’s problem, 
N = N, the unit normal of a, show that the solution to Bjorling’s problem may be 
written 


x(u,v) = Re late + if Bla’ (w)| dw ; 


0 


where B is the binormal of a. Hint: N= BxT=Bx a 


ja’ |" 


We can use the full solution to Bjérling’s problem provided by Exercise 4.1 in 
the particular case where we choose the vector field —N(t). 


Corollary 4.3. Let x(u,v) be a solution to Bjorling’s problem with curve 
a(t) and vector field N(t). Then the solution x(u,v) to Bjorling’s problem 
with curve a(t) and vector field —N(t) is given by 


X(u,v) = Re ale rif N(w) x a'(w) dw 


= x(u, —v). 


Proof. Flip the domain D about the u-axis to get a domain D and define 
X(u, v) = x(u, —v). Clearly, this surface is minimal and U(u, v) = —U(u, —v). 
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Hence, x(u,v) solves Bjérling’s problem with vector field —N(t). By the 
uniqueness of such a solution, however, we also have 


x(u,v) = Re [late + if N(w) x a’(w) dw 
uo 
and we are done. OC 


Remark 4.2. For u € I (ie. v = 0), N(u), a’(u) and z = w are real, so 
%(u,0) = a(u) = x(u,0). Since D and D are open and contain J, they must 
overlap in an open set. The Identity Theorem applies to show that, in fact, 
x(u,v) and x(u,v) are the same surface on the overlap. This means that 
we have extended x(u,v) past its original domain D by piecing it together 
with X(u,v) in D. This is an example of why the Identity Theorem is also 
called The Principle of Analytic Continuation. This is also an example of 
the minimal surface analogue to Schwarz’s Reflection Principle in complex 
analysis. Minimal surfaces, in general, may possess symmetry about lines or 
planes under appropriate conditions. Indeed, we have 


Theorem 4.4: (The Schwarz Reflection Principle). 


(1) A minimal surface is symmetric about any straight line contained in 
the surface. 

(2) A minimal surface is symmetric about any plane which intersects the 
surface orthogonally. 


The notions of symmetry above are the usual ones with respect to lines and 
planes. Namely, line symmetry means that, given a point p and line / on the 
surface, there is a point q on the surface lying the same distance away from 
| as p and the line in R? joining p and q meets | at a right angle. Similarly, 
plane symmetry means that given a point p and a plane P meeting the surface 
at aright angle, there is a point qg on the surface lying the same distance from 
P as p and the line joining p and q meets P at a right angle. Theorem 4.4 
follows from Lemma 4.5 and Exercise 4.5, as well as the fact that surfaces 
may be rigidly moved in R? without changing their geometric character. 


Example 4.3. Consider Scherk’s surface z = * In(cos ay/cosax) on the 
square —7/2 < ax < 1/2, —1/2 < ay < 1/2. As we approach any of the 
vertices of the square, we approach an indeterminate form : which, due to the 
nature of 2-variable limits, can approach any real number. That is, extending 
Scherk’s surface to the vertices of the square requires erecting a vertical line 
over each vertex. But then we may apply symmetry about a line to continue 
Scherk’s surface to a diagonally adjacent square such as 1/2 < ax < 37/2, 
1/2 < ay < 3x/2. Continuing in this manner produces the entire Scherk’s 
surface. 
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EXERCISE 4.3. Show that Scherk’s surface z = 1 In(cos ay/cosax) on the square 
—7/2< ax < 1/2, —1/2 < ay < 1/2 intersects the zy-plane in the lines y = +2. 
Also, show that these lines are lines of symmetry for the surface. 


EXERCISE 4.4. Verify the first statement of Theorem 4.4 for a ruling (0,0, v9) + 
u (cos vg, Sin vg, 0) of the helicoid x(u, v) = (ucosv, usin v, v). 


Lemma 4.5. Suppose M: x(u,v) is a minimal surface and x(u,0) is con- 
tained in the xy-plane. If x(u,v) meets the xry-plane orthogonally along 
x(u,0), then z}(u,—v) = z1(u,v), 22(u,—-v) = 2?(u,v) and 23(u,—v) = 
—a3(u, v). 


Proof. Denote x(u,0) by a(u) and let the vector field N(u) along a be de- 
fined to be the unit normal of the minimal surface U(u,0). Since a is in the 
zy-plane, it may be written a(u) = (a'(u), a?(u), 0). Also, since the sur- 
face meets the xy-plane orthogonally, N(u) = U(u,0) = (N1(u), N2(u), 0). 
With a view to applying the solution to plseine s problem, we calculate 
N(u) x a’(u) = (0,0, N*(u)(a2)'(u) — N2(u)(a!)(u)). The Identity Theo- 
rem implies that the holomorphic extensions of these quantities must have 
exactly the same formulas, but with variable z instead of u. Now we may 
apply the uniqueness of the solution to Bjorling’s problem. Since the first 
and second coordinates of N(z) x a’(z) are zero, we have x!(u, v) = Rea! (z) 
and 2?(u, v) = Rea?(z). Using Corollary 4.3 with a and —N, however, we 
see that z'(u, —v) = Real(z) and x?(u, —v) = Rea?(z) as well. Hence, the 
first two relations hold. Since the third coordinate of a is zero, x3(u,v) = 
-if N x a’ dw. Again using Corollary 4.3 with a and —N, we see that 


z3(u,—v) = +4 [N x a! dw = —23(u, v). 0 


EXERCISE 4.5. Suppose M: x(u,v) is a minimal surface and x(u,0) lies on the 
z-axis. Show that z}(u,—v) = 2(u,v), ad, —v) = —z*(u,v) and 23(u,—v) = 
—2°(u,v). Hint: a(u) = (a1(u),0,0) and N(u) = (0, N?(u), N3(u)). 


EXERCISE 4.6. Suppose a is a plane curve with unit normal N. Show that the 
minimal surface which solves Bjorling’s problem for this data intersects the plane 
containing a orthogonally. Thus, a is a geodesic because we choose a’s normal N 
as our vector field and is a line of curvature by Exercise 5.1.6. 


EXERCISE 4.7. If a(u) = (G(u),0,y(u)) with unit normal vector field N(u), show 
that the solution to Bjorling’s problem is given by 


x(u,v) = (Rest, Im [V9 (0) +7(w) dw, Reo(2)).. 
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Example 4.4: Henneberg’s Surface. Recall that (one form of) Hen- 


neberg’s surface is given by x(u,v) = (a+(u,v),2?(u,v), 2°(u,v)), where 
zi(u,v) = —1 + cosh2ucos2v, z?(u,v) = sinhusinu — } sinh 3usin3v and 
z(u,v) = —sinhucosv + 3 sinh 3ucos3v. We have already seen that this 


rather complicated parametrization comes, algebraically, from choosing the 
right generating function Fr) in the Weierstrass-Enneper representation. 
Here we wish to show that a natural geometric condition on a minimal surface 
also gives rise to Henneberg’s surface. Let a(u) = (cosh 2u — 1,0,—sinhu + 
3 sinh 3u). This is a parametrization for Neil’s parabola 2x? = 92°. 


EXERCISE 4.8. Check this statement. Hint: Show the identities cosh2u = 1+ 
2 sinh? u and § sinh 3u — sinhu = 3 sinh® wu. 


We want a to be a geodesic in our minimal surface M, so we choose a’s unit 
normal N(u) as our vector field. Note that a also will be a line of curvature 
in M since it is a plane curve. Then, we have 


a‘ (u,v) = Re (cosh 2z — 1) 

e22 + e72% 
— 
e?% cos 2u + e~ 2% cos(—2v) 


2 
= —1+ cosh 2u cos 2v 


=Re (-1+ 


and, similarly, z?(u,v) = Re(—sinhz + 5 sinh 3z) = —sinhucosv + 
3 sinh 3ucos3v. For x*(u,v), we compute a’/(z) = (2sinh2z,0,—coshz + 


cosh 3z). By Exercise 4.7, we must calculate Im [ \/ 3’? (w) + /?(w) dw = 


Im / V 4 cosh? 2w — 4 + cosh? w — 2.cosh w cosh 3w + cosh? 3w dw. 


EXERCISE 4.9. Show that the integral above reduces to 
; ; 1 
[sinh 3w + sinhwdw = 3 cosh 3z + cosh z. 


The following identities may be helpful (but use of MAPLE is highly recommended): 
cosh? 2z = 4cosh* z — 4cosh? z + 1, cosh3z = 4 cosh? z — 3cosh z, cosh z cosh 3z 
= 4cosh* z — 3cosh? x cosh? 3z = 16 cosh® z — 24 cosh* z + 9 cosh? z and sinh 3z = 
3sinh z+ 4sinh® z. 
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Hence, Im f 4/0/?(w) + y’?(w) dw 


1 
=Im (3 cosh 3z + cosh z) 
i , 
=Im (3 cosh 3u cos 3u + ; sinh 3u sin 3v + cosh ucos v + 7sinh usin v) 


1 
= 3 sinh 3u sin 3v + sinh usin v. 


Thus, z?(u,v) = 3 sinh 3usin 3v + sinh usinv. 


EXERCISE 4.10. Take the strip in the domain of u and w defined by u = 2r—1+s, 
v= n(r— i) withO<r<1,—.1<s<.1. Graph the surface obtained by applying 
the Henneberg patch x(u, v) to this strip. What surface is this? What does this say 
about Henneberg’s surface? 


EXERCISE 4.11. Let a(u) = (1 —cosu,0,u —sinu) be a cycloid. Show that the 
minimal surface containing a as a geodesic (and a line of curvature) is Catalan’s 
surface 


x(u, v) = (1 — cos(w) cosh(v), 4sin ; sinh 7 u — sin(u) cosh(v)). 


EXERCISE 4.12. Give another proof of the fact that the only minimal surface of 
revolution is a catenoid. Hint: take, for convenience, M: x(u,v) = (h(u) cosy, u, 
h(u) sin v) and, first, show that if M is minimal (i.e. H = 0), there is some u = uo 
with h’(ug) = 0. Then apply the solution to Bjérling’s problem to the parallel circle 
about ug with vector field the unit normal of the circle. 


7.5 MINIMAL SURFACES WHICH ARE NOT AREA MINIMIZING 


It has long been known that minimal surfaces do not always minimize 
area. We have already seen in Exercise 4.3.4 that this may be so for certain 
catenoids. In order to understand what is really happening in such examples 
however, it would be very nice to have a general method for analyzing them. 
It is to this that we now turn. Our discussion is an expanded version of 
that in [R]. Again, we begin with a minimal surface M: x(u,v) bounded by 
a Jordan curve C and we take a variation y‘(u, v) = x(u,v) + tV(u, v) where 
V(u,v) = p(u,v) U(u,v) is a normal vector field on M of varying length p(u, v) 
with p(C) = 0. (Here we write p(C’) = 0 to mean that p vanishes on the curve 
in the uv-plane which is carried to C by x(u,v).) Note that we are taking only 
a very special type of variation here, so we can only expect to derive necessary 
conditions for area minimization. We calculate y!, = xu+tVu, y, =xy tt, 


7.5 Minimial Surfaces Which Are Not Area Minimizing 243 


and yx y5 =x, x Xv tt[xu x Vy + Va X Xv] + #°V, x Vy. In the following, 
we use the notation: 


WM = (xy X Xy) + (Xy X Vy + Vy, X Xp) 
WN = 2(x, X Xy)* (Vi, x Vy) + (Ku X Vu) (Xu X Va) 
+ 2(xu x Vy) > (Va X Xv) + (Vu X Xv) (Vu X Xv) 


S = V/|Xu X Xv|? + 200 + 25H + O(F8). 


where O(t*) denotes terms involving powers of ¢ greater than or equal to three. 
With this notation, we see that the surface area A(t) = [ f[ ly’, x y!|dudv is 


given by 
A(t) =| [ sdudv 


Now, we are assuming M is minimal, so we know A’(0) = 0. Further, we have 
Lemma 5.1. If M is minimal, then = 0. 
Proof. First note that 

== (x, X Xp) (Xy x VL) + (xy K xy) > (Vy, X xy) 


= (Xu -Xu)(Xv Vo) — (Xu Vo) (Xv + Xu) 
si (Xy . Vu) (Xv : Xy) _- (Xu Xy) (Xp : Vu) 


by Lagrange’s identity. Now, V, = p,U + pU, and V, = pyU + pU,y, so we 
have 


Xu Vy = Xu°(p.U + pUy) 


= px, -U, since x, -U =0 
= pXy + (—S(xu)) 
=—pl by definition of J. 


Similarly, x,-Vy = —pm, xy: V, = —pm and x,-V, = —pn. Plugging these 
quantities into \ gives 


H=-pEn+pFm—pGl+pFm 
= —p(En+ Gl — 2Fm) 
= 0 


since En + Gl —2F'm is the numerator of mean curvature and M is minimal. 
0 
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Hence, S = \/|Xu X Xv|? + 174% + O(t3) and, upon taking two derivatives of 
area, we have 


2 = (add 
A(t) = ff RO aude and an) = ff SE aude. 


Note that S’ = eo) by Lemma 5.1, so that 
Slip = [Ky x Xe| and’ S’|z25 = 0. 


Hence, we have the expression for A”’(0), 


ano= ff A dua 
Kg OO KG| 


Now let’s rewrite YH using the Lagrange identity and the notation e = U,-Uy, 
$202 Ue 6 =U y Ce: We have 


MOM = 2(x,, x Xp) «(Vu x Vo) + (Xu X Vo) - (Xu x Vy) 

+ 2(xXy x Vy) + (Vu X Xv) + (Vu X Xv) + (Vu X Xu) 

= 2[(xu + Vu)(Xv + Vv) — (Xu + Vo) (Xv * Vu) 

+ (Xu + Xu)(Vo Vo) — (Xu + Vo)(Vo + Xu) + 2[(Xu * Vu)(Vo * Xe) 

— (Ku + Xv)(Vo + Vu)] + (Va » Vu) (Xv Xv) — (Va Xv) (Xv » Va) 

= 2{(—pl)(—pn) — (—pm)(—pm)] + E(p%, + p?g) — p?m? 
+ 2[(—pl)(—pn) — F(pupy +p’ f)| + G(pi, + p7e) — p?m? 

= 4p?[In — m?] + p?[Eg + Ge — 2F f] + Ep? + Gp? — 2F pupy. 


Plugging this, together with |x, xx,| = VEG — F?, into A’(0) gives A’ (0) = 


i | [4o?[In — m?] + p?[Eg + Ge-2Ff] + Ep, + Gon — 2F eur] yy, 


VEG — F? 


This is a general expression for the second variation of a minimal surface. 
As in ordinary calculus, if t = 0 provides a minimum for the function A, 
then we must have A”(0) > 0. Of course, the expression above is still much 
too complicated to analyze, so now we use what we know about minimal 
surfaces to simplify it. In particular, we may first suppose that the patch 
x(u,v) is in isothermal parameters. Hence, E = G, F = 0, 1 = —n (since 
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m?)/E?, |U,| = |Uy| = V2 + m2/VE and e = U,-Uy = 


= g. Putting these expressions into A’ (0) gives 


ve 
A" (0) = Ife lar? —m?)+2p°E 
li 


1 
E [—2p7(1? + m?) + Epi, + Ep2| dudv 


2 2 
me + E(p2 + p2)| dudu 


2 
+ p2 + p2dudu 


one 
_— 2 


= [ [28 p?K + 0% + paudo, 


Further, suppose the isothermal coordinates come from a Weierstrass-Enneper 
representation II. We have calculated previously that 
—4 


k= |FP(1 + u2 + 02)4 and E= |Fi2(1 + u? + v2)?. 


Substituting these into A” (0) gives 


—4 
A" (0) = alia + u? + vy’)? FP +u2 +024 + p + pe du dv 


=/ loom oy 5 +p, +p, dudv. 


Although ‘we have not written it explicitly, the integration to calculate 
A" (0) is carried out over a region R in the uv-parameter plane. Note that 
the last expression for A’(0) does not depend on the Weierstrass-Enneper 
representation at all. It only depends on the region R and the choice of a 
function p on that region. Hence, if we can find a function p (with p(C’) = 0) 
defined on R such that A”(0) < 0, then the minimal surface M cannot have 
minimum area among surfaces spanning C’. This brings us to 


Theorem 5.2: (Schwarz). Let M be a minimal surface spanning a curve 
C. If the closed unit disk D = {(u,v)|u2 +v? < 1} is contained in the interior 
of R, then a function p exists for which A’(0) < 0. Hence, M does not have 
minimum area among surfaces spanning C’. 


Before we give Schwarz’s proof of this result, let’s interpret it geometrically. 
As soon as we described M in terms of the Weierstrass-Enneper representa- 
tion II, we identified the parameters u and v with the real and imaginary parts 
of the complex variable rt. Of course, 7 is itself identified with the function 
g of the Weierstrass-Enneper representation I which, in turn, we have seen is 
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the Gauss map followed by stereographic projection. Therefore, since stereo- 
graphic projection from the North pole projects the lower hemisphere of S$? 
onto the unit disk D, R contains D in its interior precisely when the image 
of the Gauss map of M contains the lower hemisphere of S? in its interior. 
Of course, there is nothing special about stereographic projection from the 
North pole, so in fact we have the following geometrical version of Schwarz’s 
Theorem. 


Theorem 5.3. Let M be a minimal surface spanning a curve C. If the image 
of the Gauss map of M contains a hemisphere of S? in its interior, then M 
does not have minimum area among surfaces spanning C. 


Proof of Theorem 5.2. Let D = {(u,v,r) | u? + v? < r?} be the domain 
bounded by the cone r = Vu? + v?. Define a function on D by 


u? +? — r? 


p(u, v,7) = ute +r? 


and consider 


def —8p? 
—_—.—.~ + p? + p* dudu 
hal A 1+u? + v2)? u v 


where D(r) = {(u,v)|u? + v? < r?} is the open r-disk. Of course, this is 
A’(0) when we are in R, so we wish to show that the choice of p above leads 
to A(r) < 0 for certain values of r. First, let’s split the integral into two 
pieces and look at the integral of the last two terms of the integrand. If we 
let P = —ppy, Q = pp, and apply Green’s Theorem, we obtain 


/ “ppv du + pp, dv = | | pu, + py dudv 
u?+u2=r2 D(r) 
+ at P(Puu =P Pov) du dv. 
D(r) 
2 


Now, the left hand side is zero because p(u, v,r) = 0 whenever u? + v? = r?, 


so 
II i+ dua =—- ff p Apdudv. 
D(r) D(r) 


where Ap = Puy + Pov is the Laplacian of p. Hence, 


p 
Ap| dudv. 
oa =f ba ol ae G+u+u%e | — 


EXERCISE 5.1. Show by direct calculation, for p = p(u, v,1) = uted, that 


8p" 
Gewtep tae =0 
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By the exercise, we see that A(1) = 0. Now, r = 1 corresponds to the unit 
disk, so we would like A(r) < 0 for r’s slightly larger than 1. To show this 
is true, we need to look at A’(1). For convenience, let us change variables by 
taking s = = and t = ¢ with du=rds, dv =rdt. Then 


8p? 2 
=> p Geer. ee 


292 4 242 — r2 gs? +t? —1 def 


hg Ee ard), 
p(s, t,7) retredtr2 gt+P4+1 p(s, t) 


Therefore, in st-coordinates, p does not depend on r. Also, pu = Ps/T, Puu = 
Pss/ r? by the chain rule and similarly for v. Hence, Ay. = Ast p/r? and we 


notice that A, + does not depend on r. Now we can replace the uv-Laplace 
operator in the integral above to get 


--f fo Ne pen Sd A sa 


The fact that p and A,; are independent of r allows us to easily take the 
derivative of A(r) with respect to r. We obtain A’(r) = 


= ff SALLE AYP PEE HVE aa 
(1 + r2(s? + t2))4 


At r =1, s*+t? <1, so replacing p by its definition in terms of s and t gives 


2\ 2 
A'(1) = — f [See TE as dt 


(+s? +2)3 


1 2(1 2 t2 — 252 — t? 
--[f sla = 2t") isdt 
2442<1 (1 + s? + ¢?) 


2 2_ 4)2(1 — 2 +2 
-{{ 16(s“ +t sa t") as dt 
s24+t2<1 (1+ s? +t?) 
2 2_ 2. 
--ff 16(s? +t 12 (s? +t 1) asdt 
s24+t2<1 (fe s? + i*)° 


t? — 3 
=16 | | ie ac a 
o24t2<1 (1 + 8? + t?) 
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The numerator of the integrand is always negative since s? + t? < 1, so 
A’(1) < 0. Since this means that A(r) is decreasing at r = 1 and we have 
seen previously that A(1) = 0, it must be the case that A(r) < Oforl<r<7 
(some 7). 

Now suppose that the unit disk D is contained in the interior of the param- 
eter domain R. Then there is an r such that 1 <r < 7 and {(u,v)|u? + v? < 
r*?} CR. Define 


p(u,v,r) for u2 +? <r? 
plu, v)= 2 2 2 

0 for u“ + u* > r*. 
Note that plaz = 0 and, by the discussion above, A’’(0) < 0. Hence, the 
minimal surface given by the Weierstrass-Enneper representation which spans 
C is not area minimizing. C 


Remark 5.1. To be perfectly rigorous, we should note that the partial deriva- 
tives py, Py are not continuous on the boundary circle {(u, v)|u? + v2 = r7}, 
but may be “rounded off” suitably there while keeping A’ (0) < 0. 


We have seen in Exercise 2.1.9 that Enneper’s surface x(u, v) = (u—u3/3+ 
uv’, —v+v3/3~vu?, u?—v*) has no self-intersections for u2+v? < 3. Further, 
Exercise 2.3.7 showed that the Gauss map of Enneper’s surface (restricted to 
the disk u? + v2 < 3) covers more than a hemisphere of S?. By our discussion 
above, we infer that Enneper’s surface does not minimize area among all 
surfaces spanning the curve C given by applying the patch x to the parameter 
circle u?-+v? = R?, where 1 < R < 3. Of course, by the theorem of Douglas 
and Rad6é, there exists a least area (and hence minimal) surface spanning C. 
Therefore, there are at least two minimal surfaces spanning C’. In fact, the 
question of uniqueness for the solution to Plateau’s problem is a delicate one. 
Here is a positive result along this line which we mention without proof. 


Theorem 5.4: (Ruchert). For 0 < r < 1, Enneper’s surface is the unique 
solution to Plateau’s problem for the curve given by applying the Enneper 
patch to a parameter circle of radius r. 


EXERCISE 5.2. Show explicitly that Enneper’s surface is not least area among 
surfaces spanning C' (as above). Hints: (1) put |xy x Xy| = (1+u? +v*)? into polar 
coordinates with u = rcos@, v = rsin@, (2) compute the surface area of Enneper’s 


surface inside C 
2x pk 
Area = / [ (r? + 1)°r dr dO 
0 0 


where the r-factor comes from the Jacobian determinant. To get an explicit number, 
take R = 1.5 say. (3) In polar coordinates, the curve C’ has the form C’: (Rcos@ — 
3R° cos 36, —Rsiné@ — 4B sin 30, R* cos 26). Define a (nonminimal) cylinder with 
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directrix C' by 


y(s,6) = (Rcos6é — 5R° cos 36, s — Rsind — aR sin 36, R? cos 20) 


\ sectecanemenememen” SS a ed 
x() y(9) z() 


where 0 <0<7and0<s< 2|—Rsind— 4R3 sin 36|. Numerically (e.g. MAPLE) 
compute the surface area of this cylinder inside C' (with R = 1.5) 


415 
Knees [ aly(6)|/2'(8)2 + z/(0)2 dé. 
0 
(4) Compare with Enneper’s surface. 


Although this exercise and the result of Douglas and Rado show that there 
are at least two minimal surfaces spanning the curve C, in fact we only know 
one such surface explicitly — Enneper’s surface. The Douglas-Rad6 theorem 
is an existence theorem without an explicit formula for the least area surface 
spanning a curve. Indeed, at present there seems to be no example of two or 
more explicit minimal surfaces spanning a given Jordan curve. Of course, for 
non-Jordan curves we have the catenoid-disks example. 


7.6 MINIMAL SURFACES AND MAPLE 


MAPLE may be used to plot minimal surfaces, of course, using the “plot3d” 
command, but there are more uses which I will discuss here. First, the 
Weierstrass-Enneper representation may be put into a procedure which has 
input a holomorphic function (in some domain) and output an isothermal pa- 
rametrization of a minimal surface. In fact, I will give two procedures, one in 
rectangular coordinates and the other in polar coordinates. Sometimes one is 
more convenient than the other. 


> with(plots): 


> Weier := proc(F) 

> local Z1,Z2,X1,X2,X3,Z3,X; 

Z1 := int(F*(1-z~2),z); 

Z2 := int(IxF*(1+z~2),z); 

Z3 := int(2*F *z,z); 

X1 :=evalce(Re(subs(z = u+I«*v,Z1))); 
X2 := evalc(Re(subs(z = u+I*v,Z2))); 
X3 := evalc(Re(subs(z = u+I*v,Z3))); 
X := (X1,X2,X3}; 

end: 


VVVVVVVV 
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> Weierpol := proc(F) 


> local Z1,Z2,X1,X2,X3,Z3,X: 

> Z1 := int(F*(1-z~2),z); 

> Z2 := int(I*F*(1+z72),z); 

> Z3 := int(2*F«z,z); 

> X1 := evalc(Re(subs(z = exp(u)*(cos(v)+I«sin(v)),Z1))); 
> X2 := evalc(Re(subs(z = exp(u)*(cos(v)+I*sin(v)),Z2))); 
> X3 := evalc(Re(subs(z = exp(u)*(cos(v)+I*sin(v)),Z3))); 
> X := [X1,X2,X3]; 

> end: 


These procedures are reasonably self-explanatory. The “int” command 
takes an integral with respect to the variable (i.e. z here) listed at the end. 
In Weierpol, the “subs” command is used to plug the polar representation 
z = u(cosu +isinv) into each of the computed integrals, while “evalc” and 
“Re” take the real part of the complex expression. For example, 


> catenoid:=Weier(1/(2*z~2)); 


1 Z 241 1 
catenoid := oe —=v Bt in( Va Fv) 


Now, this isn’t a particularly pleasing form, so we can substitute for u and v 
as follows. 


> simplify(subs({u=exp(r)*cos(s),v=exp(r)*sin(s) },catenoid)); 
: (=r) (¢(2P) 1 in(s)e(-")(e2") 
5 cos(s)eS~" (e“” + 1), 5 sin(s)e\"(e“"? +1), r 


Of course, this is a usual catenoid parametrization when the multiplication 
of exponentials is carried out. This simple illustrates the main difficulty with 
calculating the Weierstrass representation in MAPLE. Namely, even if the 
integrals are able to be given in closed form, their exact form may not be one 
which is immediately recognizable as a standard surface. Of course this is 
true when the representation is worked out by hand as well. This is but one 
more instance of the general rule that, while computer algebra systems can 
do amazing and beautiful things, they are not a panacea. 


EXERCISE 6.1. Carry out the following MAPLE commands related to the helicoid. 
> helicoid:=Weier(I/(2*z~ 2)); 

> simplify(subs({u=exp(r)*cos(s),v=exp(r)*sin(s)},helicoid)); 

> hel:=simplify(Weierpol(I/(2*z~ 2),trig); 

> plot3d(hel,u=-1.2..1.2,v=0..2*Pi,scaling=constrained,grid=[10,30], shading=z); 
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EXERCISE 6.2. Carry out the following, simplify if necessary and plot the resulting 
surfaces. 


> enneper:=Weier(1); 
> ennepol:=Weierpol(1); 
> trinoid:=simplify(Weierpol(1/(2*z*(z~ 3-1)~2))); 


EXERCISE 6.3. For Henneberg’s surface, write 
> Henne:=Weierpol(1-1/z~ 4); 


Show that the expressions you obtain may be simplified to the usual parametrization 
for Henneberg’s surface. Hint: the identity cos?(v) = 7 cos(3v) + 3 cos(v) and its 
sin analogue may be useful here. Now plot Henneberg’s surface by 


> plot3d(Henne,u=-2..2,v=0..2«Pi,scaling=constrained, grid=[40,40],shading=XY); 


EXERCISE 6.4. Find Weierstrass-Enneper representations for F(z) = 1/z and 
F(z) = 1/(42°). Plot the resulting surfaces. 


Now let’s see how MAPLE may be used to visualize and calculate in a 
specific problem. Namely, Exercise 5.2 asks you to show that, for certain 
boundary curves, Enneper’s surface, although minimal, is not the spanning 
surface of least area. The problem puts forth a candidate for smaller area — 
a generalized cylinder which is not even minimal. Here’s what MAPLE can 
do for this problem. 


> with(plots): 
> enneper:=[u-u~ 3/3+u*v 2,-v+v 3/3-v*eu” 2,u” 2-v"_2]; 


The following procedure creates a parametrization for the cylinder spanning 
the boundary curve of Enneper’s surface with R = 1.5. In order to graph the 
cylinder, we need two cases — when yv is positive and when yv is negative. 
The second coordinate yu is negative when v is between 0 and 7, so here u 
must vary from 0 to 2. When v goes from 7 to 27, u must vary between —2 
and 0. 


> CylEnn := proc(r) 

> local xv,yv,zv,n,X; 

XV := recos(v)-1/3«r~ 3*xcos(3«v); 
yv := -rxsin(v)-1/3*r~ 3*sin(3*v); 
ZV := 1 2«cos(2*v); 


VVVVV Vv 


n:= abs(yv); 
X := [xv,yv+uxn,zv] 
end: 
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Now we must plot the two cases individually. Notice again that when we 
want a plot structure to be saved for later plotting, we just give the plot3d 
command a name (such as cyll below) and we use a colon at the end to 
suppress any output (which generally would run on for pages and pages). 


> cyll:=plot3d(CylEnn(1.5),u=0..2,v=0..Pi,scaling=constrained, 
grid=[5,50)): 

> cyl2:=plot3d(CylEnn(1.5),u=-2..0,v=Pi..2*Pi,scaling=constrained, 
grid=[5,50}): 


We can get Enneper’s surface in polar coordinates together with its plot as 
follows (or from Weierpol). 


> ennpolar:=simplify(subs({u=r*cos(theta) ,v=r«sin(theta)}, enneper)); 


> enn:=plot3d(ennpolar,r=0..1.5,theta=0..2*Pi,scaling=constrained, 
grid=[5,50)): 


The following represents the Jordan curve which is the boundary curve for 
both Enneper’s surface with R = 1.5 and the cylinder above. 


> jorcurve:=subs(r=1.5,ennpolar); 

We can plot the boundary curve using the “spacecurve” command. 

> bound:=spacecurve(jorcurve,theta=0..2*Pi,color=black,thickness=2): 
Now we can display the surfaces separately with the same boundary curve. 
> display({bound,enn},scaling=constrained,style=wireframe); 


> display ({bound,cyl1,cyl2} ,scaling=constrained ,style=wireframe); 


FIGURE 7.1. Enneper’s surface inside the Jordan curve 
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FIGURE 7.2. The cylinder inside the Jordan curve 


Now let’s compute the surface area of the cylinder. I shall use the notation 
of Exercise 5.2. 
> ytheta:=subs(r=1.5,ennpolar([2]); 
> x1:=diff(subs(r=1.5,ennpolar[1]),theta); 
> z1:=diff(subs(r=1.5,ennpolar(3]) theta); 
Now we can evaluate the surface area integral numerically. The necessary 
MAPLE command is “evalf” (for “floating point evaluation”) appended to 
“int.” 
> evalf(int(2abs(ytheta)*sqrt(x1~ 2+z1~ 2),theta=0..Pi)); 


31.66323514 


This is then the area of the cylinder. On the other hand, Enneper’s surface 
has surface area which may be computed explicitly to be mr?(1 +r? + (r*+)/3) 
for radius r. Therefore, for r = 1.5, 


> evalf(subs(r=1.5, Pixr™2*(1+r~2+(r7~4)/3))); 


34.90113089 


Hence, Enneper’s surface is minimal, but not area minimizing within the 
bounds of the Jordan curve above. Of course, the surface area of the cylinder 
could’ve been (and was) computed numerically without MAPLE. Yet, what 
recommends computer algebra systems to the mathematical and educational 
community is the convenience, the simplicity of the programming relative to 
the results obtained and the beauty of instant visualization. As final exercises 
for this section, 


EXERCISE 6.5. Compare the areas of cylinders of varying radii with Enneper’s 
surface. 
EXERCISE 6.6. Now do Exercise 4.10. That is, show that the domain defined in 


the problem gives a Mobius strip lying on Henneberg’s surface. Hence, Henneberg’s 
surface is nonorientable. 


Chapter 8 


THE CALCULUS 
OF VARIATIONS 
AND GEOMETRY 


8.1 THE EULER-LAGRANGE EQUATIONS 


In previous chapters, we have seen that various geometric entities have a 
tendency to minimize some quantity. For instance, geodesics try to be paths 
of shortest arclength while minimal surfaces (including their physical repre- 
sentatives, soap films) try to be surfaces of least area. Of course we have also 
seen that there are geodesics which do not minimize arclength and minimal 
surfaces which are not least area. In this chapter we will try to put these 
results into perspective by giving a quick tour of the calculus of variations — 
the subject whose principles embody this geometric (and physical) tendency 
toward minimization. Now, whole books have been written about this sub- 
ject (see [Sag], [Wei] and [Pin] for example) and the subject itself is rife with 
essential technicalities, so we shall stick with what true variationalists call 
the naive theory. Nevertheless, we shall see that variational principles and 
methods are intimately connected to geometry. 


Here’s the general set-up for the calculus of variations: let « = x(t) denote 
a function of t with fixed endpoints x(to) = zo and 2(t,) = 21. 


z(t) a(t) 


FIGURE 8.1. Curve with fixed endpoints 


Because of this picture, we often refer to z(t) as a curve joining the end- 
points. Indeed, the function z may, in fact, be a curve in n-space, so it pays 
to keep in mind this possibility. In particular, we might like to think of a 
curve x(t) = (z'(t),...,2"(t)) as being the path of a particle with respect to 
time t. This is the mechanics point of view. One potential source of confusion 
must be mentioned right away. While we have used the standard mechanics 
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notation z = x(t) = (x'(t),...,2"(t)) above, when we deal with the geometry 
of the plane, we will use the coordinates which most of us feel comfortable 
with. Namely, we will write the independent variable as x and the function in 
question y = y(x). The context of the problem should make everything clear. 


Now, there are many choices of curves x(t) joining the given endpoints. It 
is only when we add some sort of condition to be satisfied that we can pick 
special z(t) out of this collection. For example, we have the 


Fixed Endpoint Problem: 1.1. Find the curve x = z(t) with z(to) = zo 
and x(ti) = 2; such that the following integral is minimized 


r= f° f(ta(t), (0) dt 


where f(t,x,<) is a function of t, x and = dz/dt and the latter two are 
thought of as independent variables. 


Example 1.2. Let T = 1/2mz? denote the kinetic energy of a particle mov- 
ing along the z-axis. A typical potential energy function depends on the dis- 
tance of the particle from a specified point x9, so we can write V = V(|xz—2o)). 
As we shall see later, Hamilton’s principle says that the motion of the particle 
x = x(t) will be such that the integral 


p= [t-vat= f1/ami? — Vv (|e a9) a 


is minimized. (In fact, as we shall see below, this is not quite correct. Hamil- 
ton’s principle only requires that the integral be extremized.) Because this 
example is so important, it is traditional to keep the dot notation « for deriva- 
tives in even more general situations. 


Example 1.3. We have seen before that the shortest distance between two 
points in the plane is attained by a straight line. If we use x y-coordinates, 
then the problem of determining the curve y = y(x) of minimum arclength is 


simply to 
Minimize / ylt+y’ 2 dz. 


In this case, f(z,y,y’) = V1+ yl? Also, note that here we have used the 
usual prime notation for derivatives. 


To approach the fixed endpoint problem, we should first recognize that, just 
as in ordinary calculus, our methods are best suited to finding local minima, 
not global minima. With this in mind, let’s start to analyze the problem. 

Suppose z(t) is a curve which minimizes the integral J = 
ie f(t, x(t), £(t)) dt and let x*(t) = x(t) + en(t) be a variation of z. That 
is, we think of € as being small and we require that (to) = 0 and n(t,) = 0. 


256 8. The Calculus of Variations and Geometry 


Therefore, the curve x*(t) still joins ro and x, as well as being “close” to z. 
Note too that <* = +e”. With this notation, we can think of the integral 
J as a function of the parameter ¢ 


ti ty 
ie) = | f(t,a*,e*)at = | f(t,z +e€n,£ +€) dt. 
to to 


In order to recognize the minimum z(t), just as we do in ordinary calculus, 
we take the derivative of J with respect to € and note that, since z(t) is a 
minimum by hypothesis, this derivative is zero for € = 0. (Note that, speaking 
informally, we can take the derivative inside the integral because the integral 
is taken with respect to ¢ and ¢ is independent of ¢.) Now, 


dJ__ f" Of du* | Of dx" 


de J, On Oe | On" Be 
by the chain rule, so 
GE [OF OF 
de a On Oa? * ope 
with a 4 of af 
0= ae = -[ ane aga 


since, at e = 0, x*(t) = x(t). Now, the second term inside the integral may 
be integrated by parts as follows. Let 


and compute 


ti t 
of f% d (of 
1 Ot Tot (54) 


to 


ti d Of 
=~ [5 (3) # 


since n(to) = 0 = n(t,) implies the vanishing of the first term. Putting this in 
the equation above, we have 


OF f 
sls to dx -[ na (35) - 


(ola @)* 


ty of 
/ ag = 


0 


(*) 
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This equation must hold for every function 7 with n(to) = 0 = n(t1). How 
can this be? The following exercise provides an answer. 


EXERCISE 1.1. Suppose that a continuous function y = f(z) is positive at a point 
zo. Show that it is possible to choose a function n(x) so that on some interval [a, b] 


b 
[ n(x) f(x) dx > 0. 


Hints: (1) draw a graph of f(x) about xp and say something about the values of f(z) 
for all points x near x; (2) create a “bump” function n(x) which guarantees the 
positivity of the inetgral above by drawing its graph on the same axes as the graph 
of f(x) near x9; (3) remember that positive integrands produce positive integrals. 


The exercise tells us that there is only one way equation (*) can hold for all 
choices of 7. Namely, we must have 


af 4 (af) _, 
3 - a (ge) 


This equation is called the Euler-Lagrange equation and it gives us a necessary 
condition for z(t) to be a minimum. 


Theorem 1.1. If x = x(t) is a minimum for the fixed endpoint problem, 
then x satisfies the Euler-Lagrange equation. 


Notice that we are not saying that a solution to the Euler-Lagrange equa- 
tion is a solution to the fixed endpoint problem. The Euler-Lagrange equation 
is simply a first step toward solving the fixed endpoint problem, akin to finding 
critical points in calculus. Nevertheless, because we deal with an unimagin- 
ably huge collection of possible solution curves for the fixed endpoint problem, 
the Euler-Lagrange equation is a powerful tool which is indispensable. Indeed, 
it is sometimes the case that only solutions to the Euler-Lagrange equations 
may be found with little or no other information to guide us to a solution of 
the fixed endpoint problem. For this reason solutions to the Euler-Lagrange 
equation are given the special name eztremals and the fixed endpoint prob- 
lem, for example, is often rephrased to say that a curve x(t) is desired which 
joins the given endpoints and extremizes the integral J. In this case, solutions 
to the Euler-Lagrange equation solve the problem. 


EXERCISE 1.2. Suppose that J depends on two functions z(t) and y(t), 


£1 
r= | s,2(0), u(t), 2(0),0@) a 
to 
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Show that variations z* = x + en and y* = y + «7 lead to 


_aJ 
de | 0 


_ = Of O% , OF by | OF YY, 
as Ox Oe 0% Ge | Oy De OY Ge 


OF OF. OF... OF 
-[ dn! t agit Bet oe 


. f" [af Of 1 | Of Of 
~ fo n[ae~ a (se) +L, + lo a Cae) 


and that this equation can hold for all 7, 7 if and only if the following Euler-Lagrange 


equations hold, 
Of _ a (Of\_, Of _4 (of 
Ox 0a Oy oy 


EXERCISE 1.3. Suppose that J depends on two independent variables ¢ and s, 


r= ff fe,.2(t,s),20(t,9),20(t8)) dede, 


Show that a variation z*(t,s) = x(t,s) + en(t,s) with nlc = 0, where C is the 
boundary of the region of integration R, leads to 
dJ 0 0 
=— =| OF Oe Lae Te 
de 6 


Ozrt OxLs 


Further, recalling Green’s theorem f —P dt+Qds = f f 0Q/0t+0P/0s dt ds, letting 
Q=n(0f/Ozrt), P = n(Of/Oxs) and using nlc = 0, show that the last two terms 
of the integral give 


a* f 
[feere Neel te -f fn oe es 


and substitution then gives 


O° f af 
a [hn E 7 a = i. eee 


Finally, argue that this implies that the Euler-Lagrange equation for two independent 


variables is 
af a (af\ a (af) _ 
Ox Ot \ Oxt Qs \dzs] 
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EXERCISE 1.4. Suppose f(t, z,£) = f(x, £2) does not depend on t explicitly. By this, 
we mean that f only depends on ¢ through the curves x and «, so that Of /Ot = 0. 
Show that a nonconstant z(t) satisfies the Euler-Lagrange equation if and only if 


(ieee 


OL 


where c is a constant. Hint: compute the derivative with respect to t of the left- 
handside. Don’t forget the chain rule. 


EXERCISE 1.5. Suppose f(t,z,y,2,y) = f(x,y, 2, y) does not depend on t explicitly. 
By this, we mean that f only depends on ¢ through the curves z, y, x and y, so that 


Of /Ot = 0. Show that nonconstant x(t), y(t) satisfy the Euler-Lagrange equations 
if and only if 


where c is a constant. Hint: compute the derivative with respect to t of the left- 
handside. Don’t forget the chain rule. 


EXERCISE 1.6. Show that, if O0f/Ox = 0, then z(t) satisfies the Euler-Lagrange 
equation if and only if Of /Oxz = c, a constant. 


EXERCISE 1.7. Show that the integrals 
[tesa and [tena Le) aot’ dg(t, &) dt 


have the same Euler-Lagrange equation. In particular, note that when g(t, x) = ct, 
with c constant, then f f dt and f f +cdt have the same Euler-Lagrange equation. 
We will use this fact when we discuss Jacobi’s theorem in Section 8.5. 
EXERCISE 1.8. Find the extremal x(t) for the fixed endpoint problem 

i ee [<9 

=f x sint+ —2* dt 
0 2 
with «(0) = 0, z(r) =z. 
EXERCISE 1.9. Find the extremal z(t) for the fixed endpoint problem 
1 
J= i a” + &? dt 

0 
with 7(0) = 1, x(1) = 4. 
EXERCISE 1.10. Find the extremal x(t) for the fixed endpoint problem 


nm /2 
r= a? — a dt 
0 


with x(0) = 0, x(7/2) = 1. 
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EXERCISE 1.11. Consider the fixed endpoint problem with x(to) = xo, z(t1) = 21 
and integral to be minimized 
ee, ae 
j= f pa? +q° x" dt 
to 


where p = p(t) and q = q(t) are arbitrary smooth functions. 


(1) Show that if c = x(t) is an extremal for J, then 
J =p? x ay} = p* xa(t1) — p? a(t). 

(2) Show that if 2 = x(t) is an extremal for J and 7 = n(t) is a function with 
n(to) = 0 = n(t1), then 


~ t1 
Tay pent+g azndt=0. 
to 


(3) From (1) and (2) show that an extremal x(t) for J is a solution to the fixed 
endpoint problem. That is, z(t) minimizes J. 


Hints: For (1), differentiate p* xz with respect to t. For (2), compute f p zndt by 
parts. For (3), vary x by x + 7 (where the usual ¢€ is absorbed into 7) and compute 


~ t1 
5=f p? (e+)? +47 (e+ 1)? de. 
t 


[e) 


Compare J and J. 


EXERCISE 1.12. In the fixed endpoint problem, suppose the time ty is fixed (as well 
as to and z(to) of course), but the final position x(t;) is undetermined. From the 
derivation of the Euler-Lagrange equation, show that not only the Euler-Lagrange 
equation must be satisfied, but that the extra condition 


oF (ty) =0 


must be satisfied as well. Hint: everything in the derivation works as before except 
that, after integrating by parts, the first term no longer vanishes. Argue that the 
Euler-Lagrange equation is satisfied because the eventual extremal may be consid- 
ered as an extremal for the fixed endpoint problem with x(t;) being whatever point 
the extremal hits at t = ¢1. 


Remark 1.4. The preceding exercise hints that not all variational problems 
have fixed endpoints. In fact, there are many problems where either the final 
“time” t¢, or the final “state” x(t) are undetermined. As the exercise sug- 
gests, in these cases the Euler-Lagrange equation holds together with an extra 
condition which is sometimes called a transversality condition. For example, 
suppose we say that, while ¢, is unspecified, it is required that x(t) lie on a 
given curve a(t). The generalization of the condition above is then that the 
following condition (see [Pin] or [Sag] for example) must hold for an extremal 
x(t) at ti, 


flr) + (a(tx) ~ (4) A(t) = 0. 
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EXERCISE 1.13. When either t; is specified or x(t1) is specified, the preceding 
discussion is made simpler. 


(1) Suppose t; is specified, but x(t;) must only lie on a(t). Show that the 
transversality condition above reduces to 


of 7 
aq (1) = 0. 


Hint: factor out 1/a(¢1) and note that t; fixed makes a(t) a vertical line. 


(2) Suppose x(t1) is specified, but t, is free. Show that the transversality con- 
dition above reduces to 


F(tx) ~ (tr) S£(tr) = 0. 


Hint: 2(t1) fixed makes a(t) a horizontal line. 


8.2 THE BASIC EXAMPLES 


No discussion of the calculus of variations would be complete without men- 
tioning the problem which launched the subject, the brachistochrone. The 
name is taken from the Greek words brachist, which means shortest, and 
chronos, which means time. The problem itself is this: 


Example 2.1: The Brachistochrone Problem. Given a point (a,b) in 
the x y-plane, find the curve y(z) joining (a,b) and the origin so that a bead, 
under the influence of gravity, sliding along a frictionless wire in the shape 
y(z) from (a,b) to (0,0), minimizes the time of travel. 


0 0.5 1 1.5 2 2.5 3 


-1.5 


FIGURE 8.2. Bead on a wire 
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The key to setting up this problem is the simple formula D = RT, distance 
equals rate times time. The “infinitesimal” distance travelled by the bead is 
just the arclength of the wire, D = V1+y’ 2. The rate of descent may be 
determined by noting that potential energy is given by the formula mgh where 
m is the mass of the bead, g = 9.8 m/sec? is the acceleration due to gravity 
(near the surface of the Earth) and h is the height of the bead above a fixed 
reference height (which is often taken to be zero on the Earth’s surface). If we 
start the bead at (a,b) with initial velocity zero, then conservation of energy 
requires that the kinetic energy of the bead be equal to the loss of potential 
energy due to decreasing height. In other words, 


1 
mg(b—y) = 5m 


where v is the speed of the bead. We then have that v = ,/2g(b—y). So, 
D = RT tells us that time T is a function of y and 7’, 


1 1+y/ 
Ply ay y= 
= TG oy 


The total time is then found by integrating T(y, y’) with respect to z from 
a to 0. The problem of the brachistochrone now becomes the fixed endpoint 
problem 


OP J y2 
Minimize T = efoe / Vie dx 
V29 Ja VO-y 
with y(a) = b and y(0) = 0. 


To solve this problem, first substitute u = b — y and ignore the constant 
factor 1/,/2g to get a new integral to be extremized, 


=f vi+u" , 
73 we 


Because the independent variable does not appear explicitly in J, we may use 
Exercise 1.4 to get 


Jitu? , u’ 


vu ° \/u(1 + u’?) 


Now, finding a common denominator, simplifying and replacing 1/c by a new 
constant c gives a separable differential equation c = u({1 + wu’ ) which leads 


to 
z= f vu du 


= ¢. 


c—uU 


=-2 f V@-waw 
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where w = Vc — u and c is replaced by c? 


= -2 f ccos6 ecos6 dé 


where w = csin@ 


= —2c? / cos” 6 dé 


wane [88 a 


a 
= 5 [26 + sin 26] +r 


and the final answer 


xz(?) = k(¢+singd) +r 


where 20 = ¢. Now, b—y = u = c*cos”6@ = c* cos? £ by the substitutions 
above, so again using cos g = (1+ cos ¢)/2, we obtain 


y(d) = k(1+cos¢) +b 


where k = —c*/2 as for z(¢). Note that y(7) = b, so that it must also be true 
that z(7) = a. This implies that r = a — kr. If we let ¢ = @—7, then we 
obtain the parametric formulas for x and y, 


x(¢)=k(¢—sin¢)+a and y(¢)=k(1—cos¢) +). 


This is the parametrization of a cycloid. Hence, the solution to the problem 
of the brachistochrone is one of our standard curves from Chapter 1. Also, 
note that, by Exercise 1.1.6, the brachistochrone is the tautochrone as well. 
Finally, for the brachistochrone with friction, see [HK] and for a very different 
approach, see [L]. 


EXERCISE 2.1. Johann Bernoulli solved the brachistochrone problem ingeniously 
by employing Fermat’s principle that light travels to minimize time together with 
Snell’s law of refraction (see [Wei]). Here’s a problem in the same vein. Suppose that 
a light photon moves in the upper half-plane to minimize time. Suppose also that 
the plane is made of a medium whose properties imply that the speed of the photon 
is always proportional to its height above the z-axis. That is, v = ky for k > 0. 
What path does the photon take? Hint: set up the time integral and extremize it. 
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Example 2.2: Shortest distance curves in the plane. Given points 
(a,b) and (c,d) in the plane, we can ask for the curve z(t) which joins them 
and which minimizes the arclength integral 


i= a /£2(t) + y?(t) dt. 


Here we have chosen a two-variable formulation to show how these types of 
problems work. Because the integrand does not depend on either z or y, the 
Euler-Lagrange equations reduce to 


AN es ee Es eee eee 
dt a(t) + P(E) uy Je) + PO) 


Solving for the square root in each equation and setting the results equal, we 
obtain 


which gives, under integration with respect to ¢, 


=-2r4r. 
y re 


This is, of course, the equation of a straight line in the plane. Therefore, 
extremals of the arclength integral are straight lines. We will see shortly that 
these extremals are, in fact, minimizers for arclength. 


EXERCISE 2.2. Solve the shortest distance in the plane problem using one variable. 
That is, arclength is given by f J 1+y’ (x)? dz. 


Example 2.3: Least area surfaces of revolution. We have already seen 
that a minimal surface of revolution is a catenoid, so, since least area surfaces 
are minimal, this example offers another approach to the question. A surface 
of revolution has surface area given by 


A=2n fyVi+y?ae. 


EXERCISE 2.3. Use the fact that the integrand of the surface area integral above 
is independent of x to reduce the Euler-Lagrange equation to y’ = \/y? — c2. Then 
solve this separable differential equation to show y = c cosh(xz/c — d), verifying that 
a catenoid is the only minimal surface of revolution. We have already seen that a 
catenoid is least area for a given boundary only under certain circumstances, so this 
example shows again that extremals are not always minimizers. 
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EXERCISE 2.4. Let z = f(x,y) be a function of two variables. Surface area is given 


by the integral 
A= ff Visa+R ated 


Use the two-independent-variable Euler-Lagrange equation of Exercise 1.3 to show 
that an extremal for this integral is a solution to the minimal surface equation. 


EXERCISE 2.5. Let z = $(z, y) be a function of two variables. The Dirichlet integral 


is given by 
A= | [+95 dz dy. 


Use the two-independent-variable Euler-Lagrange equation of Exercise 1.3 to show 
that an extremal for this integral is a harmonic function. Dirichlet’s integral arises 
in many areas of physics and engineering (see Exercise 4.5.2). In particular, the two 
variable version above could represent the capacity (per unit length) of a cylindrical 
condenser with annular cross-sections and the analogous three variable version could 
represent the potential energy associated with an electric field. For the latter, a 
stable equilibrium is attained when potential energy is minimized, so a minimizer 
for the Dirichlet integral is identified with the potential for the field. For more 
information see [Weil]. 


Example 2.4. Hamilton’s principle is a principle of mechanics which states 
that the equations of motion of a physical system may be found by extremizing 
the so-called action integral 


y= [T-Vee, 


where J’ and V represent kinetic and potential energy respectively. Of course 
T and V may take many forms depending on the problem at hand. To un- 
derstand why such a principle might have been developed, let’s extremize the 
action for the particular one-dimensional case of a single particle moving along 
the z-axis under the influence of a conservative force field F’. Recall that this 
means that F is the (negative of the) gradient (which here reduces to a single 
partial derivative) of some potential function V(x), F = —grad V = —OV/Oz. 
(The negative sign is traditional in physics.) The particle has kinetic energy 
T = (1/2)mz?, so the action integral becomes 


J= / smi? — V (zx) dt. 


The Euler-Lagrange equation for this integral is then 


OV (x Gs. 
- M2) _ ma) =0. 
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Replacing the first term by F and carrying out the differentiation in the second 


term gives Newton’s Law 
F=mz. 


Furthermore, notice that the integrand T — V does not depend on t, so we 
may use the first integral f — «(Of /Ox) = c of Exercise 1.4 to get 


Reape 
Ox 
1 
mi’ —V — amt = 
1 
—5mt*-V=c 
1.2 
9 +V=CE 
T+V=é 
E=é 


where EF = T + V is the total energy of the particle. This calculation means 
that the total energy in a conservative force field is actually conserved (i.e. is 
a constant). The fact that both Newton’s Law and the conservation of energy 
may be derived by Hamilton’s principle indicates why it is a cornerstone of 
classical mechanics*. We will see the connection between Hamilton’s principle 
and geometry in 8.4 below. 


8.3 THE WEIERSTRASS F-FUNCTION 


So how in the world do we ever know if an extremal is truly a minimizing 
curve for a given integral J? Besides ad hoc means (as in Exercise 1.11), 
there are various sufficiency conditions for minimizers which may be used. 
None of them, however, seems particularly convenient for large problems, so 
we will stick with one which is rather straightforward. This condition is due 
to Weierstrass (whose work on minimal surfaces we’ve already met in Chapter 
1. 
Let’s say that x = z(t) is a minimizing curve with fixed endpoints for the 
integral J = f f(t,z,£)dt and & = Z(t) is another curve joining the same 
endpoints. Denote the values which has along the curves x and by J[z] and 
J(z] respectively. Then, since x is minimizing for J, it must be the case that 


AJ & J{z] — J[z] > 0. 


This condition, as it stands, is impossible to check for an extremal x against 
a comparison curve %. So, in order to compare J[Z] to J[z], we compare 


*As noted in [Wei], however, it should not be thought that Hamilton’s principle is some- 
how more fundamental than Newton’s Law. The very definition of mass is a consequence 
of F = ma and mass is certainly needed to form the integrand T — V of the action integral. 


8.3 The Weierstrass £-Function 267 


z (or more precisely, its f-value and its slope) to other extremals in that 
particular region of space. Of course, the very existence of these extremals is 
not guaranteed, so we need a definition. 


Definition 3.1. For the standard fixed endpoint problem with integral J, a 
field of extremals is a collection of extremals for J (i.e. solutions of the Euler- 
Lagrange equation) which, in a given region, satisfies the condition that, for 
each point in the region, there is precisely one extremal of the collection 
passing through it. 


Example 3.2. Consider the fixed endpoint problem with x(0) = 0, x(2) = 2 
and integral 


2 
1 
fe =~ +eb+n+4cdt. 
0 2 
The Euler-Lagrange equation gives a general extremal 
t? 
ti)= rar ct+d 


which, upon applying the initial conditions, becomes x(t) = t?/2. A field of 
extremals may be defined by using one of the constants of integration in the 
formula for a general extremal. In this way we are assured of obtaining an 
extremal for the original J. For example, take the field to be 


t? 
La(t) = ry +d. 


Now, if two of these extremals, rg, and rq,, pass through a given (t,x), then 
t?/2 +d, =t?/2 + do, resulting in d; = dz and the equality of the extremals. 
Therefore, we really do have a field of extremals. 


EXERCISE 3.1. For the fixed endpoint problem of Exercise 1.8, 
“is 1 2 
s=f x sint+—2° dt 
0 2 


with «(0) = 0, x(7) = 7, find a field of extremals. 
EXERCISE 3.2. For the fixed endpoint problem of Exercise 1.9, 


1 
s=[ ea +a? xdt 
0 


with «(0) = 1, x(1) = 4, find a field of extremals. 
EXERCISE 3.3. For the fixed endpoint problem of Exercise 1.10, 


a [2 
/ x? — x dt 
0 


with z(0) = 0, (7/2) = 1, find a field of extremals. 
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Once we have a field of extremals for a problem, we can compare the field to 
any curve joining the endpoints. More precisely, given an integral J, endpoint 
conditions x(to) = zo, x(t) = 2; and a curve Z joining the endpoints, at 
an arbitrary point (t,x) define p = p(t, x) to be the derivative of the unique 
extremal in the field which passes through (t,x). Since the function p is 
defined in terms of derivatives of extremals, it called the slope function of the 
field. In order to compare values of J along curves, we make the following 


Definition 3.3. Given a field of extremals with slope function p = p(t, x) 
associated to an integral J = f f(t, x, x) dt, the Weierstrass E(xcess)-function 
is af 

Ett, L, £,p) = FG; z, 2) ~ FG; L, Dp) t (xz a P) 5, (ts). 


Why is this £-function defined the way it is? Here’s one rather informal 
explanation. Suppose we fix (t, x) and consider f as a function of the variable 
x. Taylor’s theorem for f at the point (t,x, p) gives 
; Of O° f 
f(t, 2,4) = f(t, 2, p) + PSP) g ie) + (¢—p)? Op ana (Gb Eye) 
where the second partial is evaluated at (t,r,r) between (t,x, z) and (t, x, p). 
If we subtract the first two terms of the righthand side from both sides of the 
equation, we obtain 
0’ f 
E(t, 2,2, p) = (& — p)?— (tz, f). 
(t,x, &,p) = (£—p) Dp? | r,t) 
If we knew, for example, that the second partial of f with respect to p was 
always positive, then we could say that the E-function was always positive as 
well. Furthermore, if we integrate the E-function (evaluated on an arbitrary 
curve £), we see that there is a natural splitting 


| 80,2,2,p) at = / r(t,,3) ~ f(,8,p) — @-nF 


=f sea.ajar- f s,3,0)+ @-wSE 3,0) ae 
= Jz] — K[z] 


(t, 2, p) dt 


where KZ] is the integral 


é = [ e804 @-yZt,3n)ae 


Now, 
(1) if E were always nonnegative (so that f Edt > 0 as well) 
(2) and if K[z] = J{z] for the extremal of the fixed endpoint problem z, 
then we would have 
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AJ = J[Z] — J[e] 
= J(z| — K (a) 


= feat 
> 0 


and x(t) would be a minimizer for J. We will prove that (2) always holds. In 
fact, we will now show that K is path independent in the sense that any curve 
Z(t) produces the same value for K[z]. For this reason — and because David 
Hilbert discovered this integral kK — K is called Hilbert’s Invariant Integral. 


Lemma 3.1. The integral 
a a as Of nx 
K{[z] = f F.3.n)+ @- Uap) at 


is independent of the path Z(t) joining the endpoints xo and 2. 


Remark 3.4. Notice right away that K|z] = J[z] for an extremal x since, in 
this case, p = Z. 

Instead of proving the lemma directly, we are going to consider the two- 
variable case because it has its uses in geometry and is rarely presented explic- 
itly. Consider the fixed endpoint problem with endpoint conditions x(to) = Z0, 
y(to) = yo, (t1) = 21, y(t1) = yi and integral to be minimized 


t1 
J{z,y| = f(t, x, y, £, dy) dt. 


to 


Suppose that (x,y) = (x(t), y(t)) is an extremal for the problem. Further, 
assume that a field of extremals exists with two-variable slope function 


def,;. . 
p= (p1, P2) = (x,y). 

The associated Weierstrass --function is 
Ett, z, y, @, Y, P1, P2) = f(t, z, V525) an f(t, 2, Y, P1, po) 

A OP gins oe Of 

— (€ — pi)=—(t, 2, 9, p1, po) — (Y¥ — po t,£,Y, P1, P2) 

@- pr) 5—( )- @- pa) 5a Ut 

and Hilbert’s invariant integral is 


as bas te = 0 De 
Kz, y] acai hha. ena 


+(y ~ pst Lt, L,Y, P1, po) dt 


Of of of Of 
= | f-p— —-m—dt dz+ — dg, 
[rn 2h mans Dips “Y 
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where we have used 7dt = d¥ and Ydt = dg. The last equality turns K 
into a line integral and we can now use the standard fact about line integrals 
that they are independent of path (in a simply connected region) if and only 
if the associated vector field has a potential function. Recall that a vector 
field on an open simply connected region O C R® is a mapping F: O — 
R’, F(t,z,y) = (A(t, z, y), B(t, z, y), C(t, z, y)) and a line integral of F along 
a path a(t) = (t,z(t),y(t)) with differential da = (dt,dz,dy) is given by 
ie F-da= f Adt+ Bdzx+Cdy. The vector field F' has a potential function 
if 


ie (A, B,C) = grad @ — (dt, dx, py); 


where grad denotes the gradient and ¢, etc. denotes partial derivative with 
respected to the subscript variable. Clearly then, the condition for having a 
potential function is 


04_ 8B 0A _8C 9B _ AC 
ae Ott (<isCiSC“‘<‘i “‘(‘<‘<‘i SSC 


In particular, for K |Z, ¥] to be path independent, we need 
a a 
O(f-prap — Pan) _ 2 OF 
Ox ~ Ot Op, 


_ OF Pf dm , OF Ope 
~ Op, Ot Op? At — Ap, Ope Ot 


a 
a(f - ri gt — mgt) a Of 
OY ~ Ot Apo 
Pf Pf Ap Of An 


~ Op2dt | Op? Ot | Op\Op2 Ot 


i eens 
Op,0y — Ap,dz” 


The third condition is a condition on the field of extremals and we will make 
the tacit assumption it always holds. Such a field of extremals is called a 
Mayer field. See [Sag] for an in-depth discussion. Note that, in the one- 
variable situation, the third condition holds vacuously. In the following, we 
shall prove that the first condition holds. The second condition follows simi- 
larly. Also, in the proof below we will use the “hatless” notation x instead of © 
for the sake of simplicity. This does not mean however that z is an extremal, 
but rather that we are interested only in formal partial derivatives without 
regard to particular curves. 
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Theorem 3.2. K[Z, 4] is independent of path. 


Proof. The proof is a tedious, but instructive exercise in the chain rule. Let’s 
carry out the differentiations listed above for (1), set the quantities which we 
want to be equal equal to each other and search for a true identity. Case (2) 
will follow similarly. 


(6) o 
a(f-m3h—mdt) ay Of Of Opi O?f , Ope 
Ox ~ dx?! |acap, Op? Ox | Opidpa | Oz 
pee Sita tte 
OyOp, Opt Oy = Op, Op2 By 


which we set equal to 


SOF OF Opi OF Opa 
- Op, Ot Op? Ot Opi 0p2 Ot 


which, by simplifying and isolating Of /Ox gives 


of _ OF  Pfdm , Of dm, Of 
dx dtp, | dp? Ot | ApiOp, Ot |” axdp, 


0? f 0? f Op; Of Ape 
+ P25 ap, +P de3 Or” Bpidpy Ox 
pg ht ee 
se Op? dy P21 Ope Oy 
0? f 0? f af | Pf (On Op, Ps) 
~ Bt0p, * ” azap, +? Bydp, + Bp? & PY og | PP by. 


Of (Spa, Ope, Opa 
Opidp2 \ ot |?) ax | Pay)’ 


Now, along an extremal, p; = x and p2 = 4, so differentiation produces the 
following general relationships among p; and po. 


. _ Opy Op. Op, . _ Ope Ope Ope 
Pi = + Pi Dz + p2 Oy P2 = + pi Dr + p2 ae 
Substituting in, we obtain 
of PF OF or mr. oy 
Ox OtOp, ke OxOp, se OyOp, Op? ie Op Ope” 


Note that this is an expression in formal partial derivatives of f and in com- 
ponents of the slope function associated to a field of extremals. We can find 
another relationship between p; and p2 by using the fact that they are associ- 
ated to extremals. Namely, along an extremal the Euler-Lagrange equations 
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hold, so we have (upon replacing the usual ¢ notation by p,) and carrying out 
the chain rule, 

O [ Of 0 f 0? f 0? f 0? f Ot», Of 

Pe =. 

Op10p2 Ox 


dt Op, ) ~ Btn, | dedp, >? Bydp, * Be? T 


which is true by the Euler-Lagrange equation. Comparing this equation with 
the one above, we see that they are identical, so we have found the required 
identity. Therefore, working backwards up the chain, we see that all the 
equations are true. Hence, we have proven condition (1) and (since (2) follows 
the same way) K is independent of path. 


Corollary 3.3: The Weierstrass Condition. If a trajectory (x,y) = 
(x(t), y(t)) is a member of a field of extremals for a fixed endpoint problem 
with integral J and 


Ett, L,Y, L, Y,P1,p2) > 0, 
then (x,y) is a minimizer for J. 


Proof. Although we have explained what the proof should be earlier, we repeat 
it here for convenience and completeness. Let £ denote any other curve joining 
the endpoints. Again, recall that K[(z,y)| = J[(z,y)] for an extremal (z, y) 
since the terms involving (4 — p,) and (y — p2) vanish. 

= J{(z,y)] - K[(z,y)] 


since K is invariant, so 
AJ = | Be2.utdyspip2) dt 
>0 
since & > 0. Hence, J[(Z,¥)| > J[(z, y)]. O 
EXERCISE 3.4. In the case of one variable x, show that K[z] is independent of 


path. Then show that if z is a member of a field of extremals and E(t,x,2,p) > 0, 
then x is a minimizer. 


Example 3.5. Consider the fixed endpoint problem with x(0) = 0, x(2) = 2 
and integral 


2 
i) etme tated 
ie) 
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An extremal is x(t) = ¢*/2 and a field of extremals may be defined by zq(t) = 
t +d. Let’s compute the Weierstrass E-function. 


2 


: 1, ; . dl : 
E(t,x,¢,p) = 5 +ab+n+%—-5p'—ap—x—-p-—(*t—p)(p+z2+1) 


1 
ge — oP —Xpt+p 
Ba 1 
ae et 
Poe 
= 5 [2° - 2¢p +p’) 
1 
=5(@-9) 
> 0. 


Hence, x(t) = t?/2 is a minimizer for J. 


Example 3.6: The Brachistochrone. Assume that the brachistochrone 
problem has a field of extremals. Then, the E-function has the form 


rr a MES) 
_ Vita? J1+p? — (1+?) — pul +p? 
7 u(1 +p?) 
= L+ul? /1+ p? —1—pu' 
7 u(1 + p?) 
_ 10,¥)NG,p)| = uv) - (2) 
u(1 +p?) 


where | | denotes length and - denotes dot product. We then have 


pa wUG p= Gu) Gp) , 4 


Ju(l + p?) 


by the Schwarz inequality. Hence, the cycloid is a true minimizer for the 
brachistochrone problem. Compare [HK] and, especially, [L]. 


EXERCISE 3.5. We have seen that straight lines are extremals for the arclength 


integral 
J= [V#o+70 dt. 
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Clearly, by simply translating the line up and down we obtain a field of extremals. 
Compute the two-variable Weierstrass £-function and show that straight lines really 
do give the shortest distance between two points in the plane. Hint: follow the 
brachistochrone example. 


EXERCISE 3.6. Show that the Weierstrass E-function is always nonnegative for the 
extremal of the surface of revolution of least area (see Example 2.3 and Exercise 
2.3). The reason that the catenoid does not always minimize area is that, beyond a 
certain point, it no longer sits in a field of extremals. 


EXERCISE 3.7. For the fixed endpoint problem of Exercise 1.8, 
a 1 .2 
s= | x sint+—2* dt 
0 2 


with x(0) = 0, z(x) = 7, compute the Weierstrass £-function to show that the 
extremal is a minimizer. 


EXERCISE 3.8. For the fixed endpoint problem of Exercise 1.9, 
1 
J= | ta” + a" adt 
0 


with «(0) = 1, z(1) = 4, compute the Weierstrass E-function to show that the 
extremal is a minimizer. 


EXERCISE 3.9. For the fixed endpoint problem of Exercise 1.10, 


n/2 
/ az? — a dt 
0 


with z(0) = 0, z(z/2) = 1, compute the Weierstrass E-function to show that the 
extremal is a minimizer. 


EXERCISE 3.10. For the fixed endpoint problem, 


1 
i. t? — 47 dt 
0 


with z(0) = 0, z(1) = 1, compute the Weierstrass E-function to show that the 
extremal is a maximizer. 


The Weierstrass E-function provides a powerful tool for identifying mini- 
mizing curves. Nevertheless, it does not provide a panacea. For one thing, 
we are usually able to show E > 0 only when some sort of special algebraic 
structure is present. In particular, we usually prove E > 0 for an arbitrary 
variable p as opposed to actually using specific properties of a particular slope 
function p. This is so simply because information about p rarely helps. There- 
fore, while it may not be hard to show E nonnegative for relatively simple 
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problems, for more complicated problems, the algebraic complexity goes up 
exponentially. Secondly, the existence of a field of extremals is by no means 
something which is guaranteed. Indeed, there is a Jacobi differential equa- 
tion akin to that of Chapter 5 and a notion of conjugate point which detects 
whether an extremal sits in a field. Specialized to geodesics, this explains why 
going past a conjugate point on a geodesic implies that the geodesic is not 
minimizing. This general Jacobi condition is too complicated to be presented 
fully in our naive context, but the interested reader should see [Pin] for an 
elementary (and brief) discussion, as well as [Bli], [Ew] and [Sag]. We will 
only present a sketch of the situation for geodesics in the following example. 
For details, see [Hsi]. 


Example 3.7: The Jacobi Equation Revisited. 

Suppose that the v-parameter curve a(v) = x(0,v) is a shortest distance 
geodesic joining two fixed endpoints and the u-parameter curves x(u, vo) are 
geodesics orthogonal to a. In fact, it can be shown that such a patch always 
exists in a region near a and, in this case, the parameters u and v are called 
Fermi coordinates. By taking v to be the a arclength parameter, we obtain 
the metric coefficients E = 1, F =0 and G>0 with 


G(0,v) =1 and = G,, (0, v) = 0. 


The first equality follows from the fact that the geodesic a = x(0, v) is param- 
etrized by arclength, so G(0,v) = x,-xXy = a@’- a’ =1. The second equality 
follows from the formula developed for geodesic curvature in Theorem 5.4.1. 
We have k, = 0 since a is a geodesic and the angle from a’ to x, as well as the 
metric coefficient E = 1 are constants. Hence, G,(0,v) = 0. The equalities 
above are saying that, along the curve a, the metric looks like a Euclidean 
metric up to first order derivatives. This allows us to carry out a “second 
variation” below as if we were in the plane. 


EXERCISE 3.11. Use the equalities G(0,v) = 1 and Gy(0,v) = 0 to show that, 
along the curve a = x(0,v), the Gauss curvature of the surface is given by K = 
—Guu(0,v)/2. Hint: use Theorem 3.4.1. 


Now let’s choose a variation of a of the form x(en(v), v) where € is small and 
n(a) = 0, n(b) = 0 (and a(a), a(b) are fixed endpoints on a). The arclength 
integral is then 


b 
Jer / e2n!? + Glen(v), v) dv 
with e-derivative equal to 


_ [ 2en’? + Gun 
=0 a 2 


dJ 
Oreae 
° en!” + G(en(v), v) 


e=0 
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since we assume a@ to be shortest distance. Also, since a gives a minimum for 
arclength, the second derivative of the arclength function (at « = 0) must be 
nonnegative. Therefore, we obtain d?.J/de? = 


*[2n/? + Gun] Ven’? + G — [2er/” + Gun|[2en!? + Gunl/2Ven? + , 
I en? +G . 
_ f? 220! + Guun?|le?n'” + G] — Ren!” + Gunl? , 
7 i 2(€2n!? + G)3/2 - 


Upon evaluation at € = 0, we get 


PI) [ 2[2n!? + Guu(0,v)n?] G(0,v) -— G30, v)n? 
de?|_. Ja 2G3/2(0,v) 


b 
’ i Qn!” + Guu(0,v)n? do 


since G(0,v) = 1 and G,,(0,v) = 0. If a truly is a shortest length geodesic 
joining the endpoints, then, as a minimum of the function J(e), the integral 
above (i.e. the second derivative of J) must be nonnegative for all choices 
of n with n(a) = 0 = n(b). But the integrand of J’s second derivative is a 
function f(v, 7,7’) and so an 7 giving the minimum of the integral (which we 
want to be zero) must satisfy the Euler-Lagrange equation for this integral. 
Thus, we arrive at Jacobi’s secondary variational problem. Namely, we must 
calculate the Euler-Lagrange equation of J’s second derivative. 


d 
2Guu(0,v) 7 — ae (4n') = 0 


Guu(0, v) 
My UU ? 
qu 2 
n + K(0,v)7 =0 


since Gy,,(0,v) = 2K by Exercise 3.11 above. Of course this is the Jacobi 
equation of Chapter 6. Now, here’s the connection between solutions of the 
Jacobi equation and geodesics of shortest length. Recall that a point c € [a, b] 
is conjugate to a if there is a (not identically zero) solution of the Jacobi 
equation 7 such that n(a) = 0 and 7n(c) = 0. (We assume c is the first such 
point in the interval [a, b].) We shall show that the existence of a conjugate 
point implies that there is an 7 for which the second derivative is negative 
2 

A): <0. 

de? e=0 
By the discussion above, this means that a could not possibly be a minimum 
for the arclength function. In other words, a geodesic is shortest length only 
up to the first conjugate point. (Technically speaking, we can actually only say 
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that, for all curves sufficiently close to the geodesic, the geodesic is shortest 
length only up to the first conjugate point.) 

To prove that a solution 7 to Jacobi’s equation with n(c) = 0 gives a 
negative second derivative, we must use a technical tool from the calculus of 
variations called the Weierstrass-Erdmann corner condition (see [Pin] or [Sag] 
for example). This condition says that, along a piecewise smooth extremal 
a(t) (i.e. an extremal which is smooth except at a finite number of corners) for 
an integral J = f f(t,z, x) dt, the partial derivative Of /Oz must be continuous 
at a corner T. That is, if we take the limit of partials on both sides of the 
corner as we approach the corner T, we must get the same answer. With this 
in mind, let 7 be a nonzero solution to the Jacobi equation 7’ + Kn = 0 
(along a) with n(c) = 0 and n(p) # 0 for all p € (a,c). Define 


_ f nlp) for p€ [a, ¢] 
é(p) = { 0 for p € |c, b]. 


For this variation € (and using K = —Gy,,/2), the second derivative is 
d? J ee 
de Meee er 
de? | 5 [ é oa 


since € = 7 on [a,c] and € = 0 otherwise. Integrating the first term by parts 
gives 


dJ 


Cc Cc 
qa = 2m! -2 f mf" dv —2 | K 7? du 
a a 


€=0 


and since 7) is zero at a and c, 


= -2 | n{n” + Kn} dv 
—0 


since 7) satisfies the Jacobi equation. So, € gives a value of zero for the second 
derivative. Could € in fact be a minimizing curve for the second derivative? 
If so, then the Weierstrass-Erdmann corner condition would have to be satis- 
fied. On one side of the corner c, the 7 derivative of J’s second derivative’s 
integrand 7/” — K 7? is 2n' while on the other side (where £ = 0), the 7 de- 
rivative is zero. Hence, we obtain 7'(c) = 0. But we already know that 7 is 
a solution to a second order linear differential equation (the Jacobi equation) 
and 7(c) = 0. By the uniqueness of solutions of such differential equations 
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with given initial data, it must be the case that 7 = 0 identically. This contra- 
dicts our assumption about 7. Therefore, the Weierstrass-Erdmann condition 
cannot hold for 7 and, consequently, it cannot be a minimum for the sec- 
ond derivative. Since 7 gives a value of zero for the second derivative, there 
must exist a piecewise smooth function 7 which makes the second derivative 
negative, 


e=0 

In fact, any corners of such a function may b& “rounded off” to produce a 
smooth such 7. The existence of such an 7 then implies that the original 
geodesic a is not a minimum of arclength for any point past c. This exam- 
ple illustrates the importance of the Jacobi equation in determining whether 
or not extremals actually minimize. Also, from the definition of the patch 
x(u,v) itself, we see that the Jacobi equation is intimately connected with 
the existence of fields of extremals. 


Besides these questions about fields of extremals, the E-function and con- 
jugate points, there is a question which we have avoided up to now. While 
the Euler-Lagrange equations provide a necessary condition for an extremum, 
there may in fact be no solution to a given variational problem. For instance, 
if we consider the plane with the origin removed and ask for the curve of 
shortest length joining (1,0) and (—1,0), then there is no solution because 
the straight line extremal joining the endpoints passes through (0,0) and, 
therefore, is not allowed. In the calculus of variations, each problem must be 
analyzed separately to determine if a solution exists. Of course, the question 
of what types of functions may be allowed as solutions arises as well. Usu- 
ally, as in Example 3.7 above, an admissible class os functions for variational 
problems is the class of piecewise smooth functions. These are continuous 
functions whose derivative(s) exist and are continuous everywhere except at a 
finite set of points where corners occur. The following exercise indicates some 
of the subtleties of the subject. 


EXERCISE 3.12. For the fixed endpoint problem with x(0) = 1, z(1) = 0 and 
1 4 1 
sta] = [ xi+x2i+-— dt, 
0 2 
a.) Show that the only extremal is z(t) = 0, so that there is no minimizer. 


b.) Show that J[z] > 0. Hint: show that J[z] = is a‘ dt. Use «dt = dz. 
c.) Show J[z] gets arbitrarily close to 0. Hint: consider the piecewise linear function 


defined by 
(t) 1l—et 0 
x£ = 
0 2 


Le Led 


<t< 
<t< 


and compute J[z] for these functions. 


8.4 Problems with Constraints 279 


There are various methods for analyzing whether extremals are minimizers, 
many of which can be found in texts such as [Bli], [Sag] and [Ew] for example. 
In particular there is a general approach which depends on the so-called second 
variation and, while we do not have space to discuss this in generality, we 
point out that Example 3.7 above and the method of Schwarz considered in 
Chapter 7.5 exemplify this method. See [DoC1] for a good discussion of the 
second variation as applied to geodesics. 


8.4 PROBLEMS WITH CONSTRAINTS 


Often, variational problems come with extra side conditions which must 
also be satisfied. For example, we saw in Chapter 4 that we can minimize the 
surface area of a compact surface enclosing a region of space subject to the 
constraint that the enclosed volume remain constant. Recall that the mini- 
mizer surface turned out to be a sphere. (For a variation of this constrained 
problem designed to describe the shape of a mylar balloon, see [Pau].) In fact, 
the most ancient of all these types of problems is the following two-dimensional 
version: given a closed curve of fixed arclength, what is the shape the curve 
should assume to maximize the enclosed area? Because of this, constrained 
variational problems are sometimes called isoperimetric (i.e. same perimeter) 
problems. In this section we shall see what modifications are necessary in our 
previous work to handle variational problems with constraints. The standard 
problem shall be, 


Minimize J = f (t,x, xz) dt 
to 


subject to the endpoint conditions z(to) = Zo, z(ti) = r1 and the requirement 


that 
ti 


i= g(t, x2, %)dt=c 


to 


where c is a constant. 

We will now derive a form of the Euler-Lagrange equation which is a nec- 
essary condition for the solution of the constrained problem. Just as be 
fore, assume x = x(t) is a minimizer for the problem and take a variation 
& = £+e(an+ bf) where n(to) = 0 = (ti) and €(to) = 0 = E(t1). We 
take two “perturbations” 7 and € because taking only one would not allow 
us to vary J while holding I constant. By taking 7 and €, we can vary J 
while offsetting the effects of one perturbation with the other in J. The usual 
Euler-Lagrange argument gives 


dele J 7+ (Gea (ae) # 


ee rs 
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Now, however, the usual argument must be modified because an + bé is not 
arbitrary. The requirement J = c puts restrictions on a and b. If we carry 
through the argument above for J, we get 


dl ft Og d (0g 
0= — = b —--|=— dt 
de ae) & di (33 
where the derivative is zero since ] = cis aconstant. But z is not an extremal 
for I, so the Euler-Lagrange equation with g does not hold. Instead, we have 


t Of d (of eee dg d (0g 
[ ce) (3 "ai &) oo, i. aS) (5 "ai (38) ig 


which produces 


e=0 


Upon rearranging we obtain 

t a a (od t a a (0 

t a a (a "et a d (a 

i é (2-2 (32)) at fin (22- ¢ (38) at 
The lefthand side is a function of € while the right is a function of 7, so 
the only way for these expressions to be identical is for both expressions to 


be equal to the same constant » (the Lagrange multiplier). Simplifying the 
expression 


gives the equation 


{o[E22-f(2)] a= 


Now the usual argument may be continued. Because 77 is arbitrary, the previ- 
ous equation can hold only if the term in brackets vanishes. We, thus, obtain 
the following Euler-Lagrange necessary condition for the constrained problem. 


Theorem 4.1. Ifx = x(t) is a solution to the standard constrained problem, 


~ Of—Ag)_ da (AFf-Ag) 
g g 
Af Ao) _ © (ALA) wo, 
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Example 4.1. The bending energy of a plane curve is defined to be the 
integral of squared curvature over the length of the curve, { «?(s) ds. Recall 
from Chapter 1 that the curvature of a plane curve also is given by 6 = d@ /ds, 
where @(s) is the angle the unit tangent T(s) makes with the z-axis. As 
we shall see when we discuss Hamilton’s principle, it is not unreasonable 
to assume that a wire will try to assume a shape which minimizes bending 
energy. (For a biological application of this idea see [Can] as well as [DH1] 
and [DH2].) Therefore, we can ask the following question. What shape will a 
wire take if the total turning of its tangent is fixed and the turning is zero at 
the endpoints? As a constrained variational problem, we write 


1 1 

; 1 

Minimize J = i 1 gp ds subject to l= / 6ds = — 
0 2 0 6 


with 6(0) = 0, 6(1) = 0. 
The factor 1/2 in the first integral is only put in to make calculations 
cleaner. Minimizing bending energy is clearly equivalent to minimizing one- 
half of bending energy. Similarly, the 1/6 value of the second integral is chosen 


only to avoid fractions later on. This said, the Euler-Lagrange equation for 
the constrained problem is —\ — 26 = 0 with solution 


6(s) = 5(-3? +s). 


An application of the constraint gives 


Hence, A = 2 and 6(s) = —s* +s. Further, a field of extremals is given by 
64 = —s? +84 and the Weierstrass E-function is E = 1/2(6 — p)? > 0. 
Hence, @ is a true minimizer for bending energy subject to the constraint. We 
have met the curve corresponding to 6 previously in Exercise 1.5.11. Recall 
that a plane curve G(s) with curvature & = d0/ds may be reconstructed by 
the formula 


B(s) = ([ cos(@(u)) du, [ sin(6(u)) iu) 


In the case at hand, 0(s) = —s? + s and we have a form of Euler’s spiral. 
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FIGURE 8.3. Euler’s spiral (the spiral of Cornu) 


Remark 4.2. The reader should compare and contrast the preceding example 
with the classical problem of the elastic rod (see [Sag]) of fixed length L. 
Suppose a rod is clamped at points (a, ya) and (b, yp») at fixed angles 0, and 
6, respectively. The shape the rod assumes at equilibrium is determined by 
minimizing the potential energy of the rod and this quantity is proportional 
to [«?(s)ds. The constraints of the problem are determined by the fixed 
angles. Namely, projections onto the z and y axes give, i cos6ds = b—a 
and i sinOds = yo — ya. Therefore, using two Lagrange multipliers, we 
extremize the integral 


L 2 
=f (=) — \;, cos @ — Ag siné ds 
0 \ds 


to obtain the Euler-Lagrange equation 


ds 


EXERCISE 4.1. Integrate the Euler-Lagrange equation with respect to s to obtain 


dd yd 
i Bag 5 = +C. 
Explain why all inflection points of the elastic rod lie on a straight line. 
If the line determined in Exercise 4.1, —Aly + Arg +C = 0, is rotated to 
become the new y axis, then the shape of the elastic rod has the expression 


2 
y= / z*/2+a roe 


/1 — (x?2/2 +a)? 


A; sin 8 — Az cos 6 — a. (25) yO 
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EXERCISE 4.2. Solve the constrained problem 
or apie ‘ ‘ : 7 
Minimize J = 5% + xx dt subject to I= zdt = — 
0 0 
with endpoint conditions z(0) = 0 and 2(1) = 0. 


EXERCISE 4.3. Solve the constrained problem 


1 1 
Minimize J = / at? — si dt subject to I= i dt 1 
0 0 


with endpoint conditions x(0) = 0 and z(1) = 1. 


EXERCISE 4.4. In the notation of this section, show that (under the assumption 
that the Lagrange multiplier \ is nonzero) a minimizer z(t) for J subject to fixed I 
is a minimizer for I subject to fixed J. 


EXERCISE 4.5. From Exercise 1.1.10 we know that a freely hanging chain of fixed 
length hangs in the shape of a catenary y = c cos(z/c +d). Show that this fol- 
lows not only by the force diagram of Exercise 1.1.10, but by energy considera- 
tions as well. Namely, the potential energy of the hanging chain is proportional to 


f ua yy 1+ y’* dz and equilibrium position of the chain is assumed when potential 


energy is minimized. The problem is constrained, however, by the fixed length of 


the chain L = f{ ne 4flty’ 2 dx. Therefore, the equilibrium shape y will be attained 
when the following integral is extremized: 


p= fw ity?-ayi+y? dz. 


Zo 
Extremize this integral to obtain a catenary. Hint: the integrand is independent of 


z. 


Example 4.3: The Isoperimetric Problem. Suppose we have a closed 


curve of fixed length L = [ \/z? + y? dt and we wish to enclose the maximum 
amount of area. Green’s theorem allows us to express area as a line integral 
f (1/2)(—y& + ay) dt so that the variational problem becomes: 


Maximize J = [orev + ary) —AvV a? + y? dt. 


The two variable Euler-Lagrange equations are 


which may be rewritten as 


a pee ern =0 a en eee. = 0 
dt y /g2 + y2 = dt /g2 + y2 = 
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We then have 
AL AY 
+e = C and —- 2+ ——=—=-D 
y | 72 +y? / pp ze y? 
and a little algebra produces 
(c—D)’ + (y-C)? =», 


a circle of radius r = X centered at (D,C). Because, for a circle, L = 27r, we 
have r= A= L/2n. 


EXERCISE 4.6. Show that the two variable Weierstrass F-function is always non- 
positive for this problem. Thus, assuming it may be embedded in a field of extremals, 
the circle provides a maximum for the integral J and is, therefore, the curve of fixed 
length which encloses the largest area. 


EXERCISE 4.7. Show by variational methods that the circle also solves the problem 
of finding the curve of shortest length which encloses a fixed area. Then show the 
same thing by the following argument. Let a be a closed curve of fixed area A and 
arclength L. Since the circle solves the problem of finding a curve of fixed length 
having maximum area, we must have A < L?/4m. Take a circle of area A and show 
by simple algebra that the arclength of this circle is smaller than L. Hence, for given 
area A, the circle has smallest arclength. 


Integral constraints such as those above are not the only types of con- 
straints which arise in variational problems. We shall consider one more class 
of constraints which pertain to geometry and mechanics, the so-called holo- 
nomic constraints. Consider the following situation: a marble is placed inside 
a frictionless bowl in the shape of a hemisphere and the marble is given a 
horizontal push. What is then the trajectory of the marble inside the bowl? 
There are two forces to be dealt with here, gravity and the force which con- 
strains the marble to remain in the bowl. It is a principle due to D’Alembert 
that the force of constraint is a normal force to the bowl and so does no 
work on the marble. See [Arn] for a discussion of holonomic constraints and 
D’Alembert’s principle. What this all means is this. In order to discover the 
equations of motion of the marble, it suffices to consider the gravitational 
potential V(z, y, z) = mgz and kinetic energy T = (1/2)m(az? + y? + 2?) re- 
stricted to the surface itself. In other words, because a parametrization of the 
sphere of radius R is given by 


x(u,v) = (Rcosu cosv, Rsinu cosv, Rsinv), 


we obtain the restricted forms of V and T, 


1 ; ; 
V=mgRsinv, T= 57K? (cos vu? +0), 
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To show the latter, we can either compute Z, y and 2 directly (by the chain 
rule) in terms of % and © or we can reason as follows. The path of the marble 
is given in three space by a curve a(t) = (x(t), y(t), z(t)) with velocity vector 
a’(t) = (a, y, 2). The kinetic energy may therefore be written T = (1/2)|a’|?. 
As we have seen many times before, the chain rule gives a’ = x, U+X, 0 with 


lo’ |? = xy xy U2 + Xy xy 0? 


since the metric coefficient F' is zero. Hence, 
lv’ P= EW+G0" 
and the kinetic energy becomes 
T= sm( Ei + Gi’). 


Of course, for the R-sphere, E = cos* v and G = 1, so we obtain the expression 
for T above. What we have just done embodies the essence of the term 
“holonomic constraints.” In fact, we may say that constraints are holonomic 
when they arise by simply inserting the parametrization of the constraint 
surface into the usual mechanical quantities such as kinetic and potential 
energy. By the discussion above, the action integral which is relevant to the 
marble in the hemispherical bowl is 


1 
J= [jm (cos? vi? + 0") — mgRsinv dt. 


The Euler-Lagrange equations for this two variable problem are 


mR? cos? vt = c 


—mgRcosv — mR? sinv cos v tu? — mR?% = 0, 


or, more simply, 


eo Cc ie 5. 3G ’ * ee 
(t) Wi a at and + 7 cosy + sinucos vt = (). 


These are then the equations of motion of the marble. The trajectory corre- 
sponding to these equations (and plotted by MAPLE with g = m = R = 1) 
is shown below. 
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FIGURE 8.4. Marble in a hemispherical bow] 


EXERCISE 4.8. Find the equations of motion of a marble in a paraboloid shaped 
bowl. Take the paraboloid’s parametrization to be x(u,v) = (ucosv, usin, u?), 


We can see the influence of geometric ideas on physics if we consider the 
case of a particle constrained to a surface M: x(u,v) with zero potential, 
V =0. In physics it would be said that the particle moves inertially on the 
surface — that is, only under the influence of the constraint forces. Accord- 
ing to D’Alembert’s principle, these forces are normal to the surface, so the 
particle’s acceleration is likewise in the normal direction. In geometry we rec- 
ognize the path of such a particle as a geodesic. Indeed, we have (taking, for 
simplicity, an orthogonal patch) 


Theorem 4.2. If the potential is zero, then, according to Hamilton’s princi- 
ple, constrained particles move along geodesics. 


Proof. The kinetic energy is given by 
1 1 
T= 5m’ |? = gE wu + Gov?) 


with action J = f Tdt. The Euler-Lagrange equations (with m divided out) 


for this action are 
Eu .2 Gx is d “\ Ey .2 G, 
‘oo 5 ~ (Fu) = 0 and 5 ae 


The ¢t-derivatives are 


d 
nee ee 
ar (Gv) = 0. 


d 
a EY) = (Eyt + Eyy)u + Eu 
d 
qi (C?) = (Gyti + Gyt)v + Gi 
so, rearranging and simplifying, we obtain 
E E Gu 2 


ut seu + iad 5B” 0 
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and 


vi Ey 2 Gy ew Gy 2 
—-—1U — —v* =0. 
Vv Yee + G uv + Yeu 
By Hamilton’s principle, these are the equations of motion of the particle. By 
our work in Chapter 5, they are also the equations describing a geodesic. O 


EXERCISE 4.9. Show that the two variable Weierstrass E-function for the action 
J = { T dt is always nonnegative. Therefore, whether or not a geodesic minimizes 
the action is simply a question about the existence of a field of extremals. 


EXERCISE 4.10. If V = V(u, v) is a nonzero potential for the problem of a particle 
constrained to a surface, show that the equations of motion for the particle are 


a Eu .2 Ey .. Gu .2 
uit spe + Eu aE? = ~—Vyz 


and 


as Ey .2 Gass Gy .2 
v ag + Gq Wt sat = —Vy,. 


Recalling how the geodesic equations arose, we may write these equations as 
1 / 
Atan = Varia = —grad V, 


where the gradient of V is taken with respect to u and v. Thus, the gradient of the 
potential is the obstruction to the particle travelling along a geodesic. 


8.5 FURTHER APPLICATIONS TO GEOMETRY AND MECHANICS 


In this section we will look at geodesics as potential minimizers of arclength. 
Furthermore, we will generalize Theorem 4.2 to say that the motions of parti- 
cles are always geodesics — with respect to a metric on the surface conformal 
to the original one. We will also see how the surfaces of Delaunay of Chapter 
3.6 arise from a variational principle. 

We have said all along that, philosophically, geodesics should be curves of 
shortest arclength. We took the definition of geodesic to be a curve whose 
tangential component of acceleration vanishes, however, because this charac- 
terized lines in 3-space and seemed a more directly verifiable condition. We 
learned in Theorem 6.7.3 (and Example 3.7) that geodesics do not always 
minimize arclength, but we may still ask if they eztremize arclength. As 
usual, we assume for convenience that the metric has F = 0. 
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Theorem 5.1. Let M: x(u,v) be a surface with metric E and G. Then, 
extremals for the arclength integral 


y= | VEWTGR at 


are geodesics on the surface M. 

Proof. We calculate the two variable Euler-Lagrange equations to be 
oe - 5 ( Eu Ne 
2WEUW+Gi2 dt\VEW+Gv 
E,W? + Gy v* sal Go iE 
2EW+Gi? dt \JEw+Ge? 


Now we introduce the arclength parameter s(t) since we know geodesics must 
be parametrized to have constant speed. This means that we have the rela- 


tions q q 
se ag “2 “2 
a a VEu2+Gou 


, du dudt u , adv dvdt v 


“ds dtds VJEW+Ge | ds dtds JEW+GR 
Putting these into the equations above (and dividing out by the factor 


VEU? + Gd?) gives 


Eu 42 Gu s2 d n 
— —v ——(E£u) =0 
5 + 5 75 | u) 
Ey 72 Gy 42 d Pens: 
5 U + re a (Gv’) = 0. 
Carrying out the differentiation and simplifying produces 
Ey 2 -£E 2 
/ Uist Df Ui? 
ark jolt = =) 
“Top tpt’ op 
and 
FE. 2 G G 2 
i a Uuisre VUog 
—-— = —v =0 
eo 9@e ogre. nage 
These are the geodesic equations. O 


Remark 5.1. The Weierstrass E-function for the arclength integral is com- 
puted to be 


Blt, u,v, %,¢ VEt + Gi?,/Ep: + Gps — (Eup; + Gop) 
) U,V, U,V, ’ So a eS ee eee 
Pe VE@+OR 
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The fact that E > 0 follows because the metric E, G gives an inner product 
to each tangent plane defined by 


vow= Lvw1+ Grow? 


and the Schwarz inequality |v o w| < |v||w| holds for all inner products. 
Therefore, once again we see that the shortest distance property of existing 
geodesics depends entirely on the existence of a field of extremals. This in 
turn, recall, depends on the nonexistence of zeros for solutions to the Jacobi 
equation. 


EXERCISE 5.1. Consider the (arclength parametrized) Euler-Lagrange equations 
for the arclength integral above, 


(Bu) = Sey + Sey? 
£ Gv’) = a + ay? 


and let a(s) be an extremal (i.e. a solution curve for these equations). Show directly 
that the tangential component of a” is zero. Hints: write a’ = xu u' + xy v' so that 
Eu! = a! -xy and Gu' = a’ -xy. Differentiate the righthand sides of these equations 
and compare with the Euler-Lagrange equations. 


Now that we know that geodesics are extremals of the arclength integral, we 
can give a beautiful interpretation due to Jacobi of the equations of motion of 
a constrained conservative system. Recall that kinetic and potential energies 
have the form 

1 
T=- 
2 


da |? 


1 
| = (Bi? +G0*) and V=V(u,0), 


where a is a particle trajectory. The action integral f T — V dt is independent 
of t, so the two variable first integral of Exercise 1.5 implies that energy 
H =T-+V is a constant along paths of motion. Specifically, 


OT -V) aT -V) | 
T-V a a a Sa a 
(Bi? + G8) —V —4(B4) - (G8) =e 


1 
5 (Ew + Gt”) -V — Ew — Gt? =c 
-5 (BW? + Gv?)-V=c 
T+V=-c. 


Here and in the following we use H for energy to avoid confusion with the E 
of the metric. 
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Now let’s take a curve a: I — M representing the motion of a particle 
determined by Hamilton’s principle on a constraint surface M. We take the 
metric of the surface to be E, F = 0 and G. Because we know that the path of 
the particle must conserve energy and because this condition is incompatible 
with the unit speed condition, we must explicitly reparametrize the curve to 
have constant energy. To do this, let Hp denote the constant energy along the 
path of the particle and define 


ena Gv? dt. 
r= | D(H ~V ue + Gu 


Lemma 5.2. The reparametrized curve a(T) has constant energy. 


Proof. First, note that the fundamental theorem of calculus implies 


dr 1 
— = ————————— y/ Fr? +-:« G0 
dt \/ 2(Ho — V) 


so that the chain rule then gives 


du _ dudt _— du y3(Ho=V) 
dr dtdr ~— dt /Ew2? + Go? 
dv dv dt dv »/2(Ho — V) 


dr dt dr dt /Euw+Go. 


Using this, the following calculation then shows that this reparametrization 
conserves energy at the value Ho along the curve. 


T(r) + V(r) = 5 (0 (#) +G(r) (2) | +V(r) 


af du\? 2(Ho —V) dv\* 2(Hp — V) 
-3(#($) aaraat¢(@) Ee+on)t* 


1 (Eu? + Gi? 
Js V)- | 
(He "5 (Garren) “ 
=H)-V+V 
= Ho. 


0 


Furthermore, note that this calculation shows that T(r) = Ho — V. Also, the 
general energy equation T+V = H gives 2T-—H = 2T-(T+V) =T-—V. We 
shall use both of these relations in the following to determine the equations of 
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motion of a particle as extremals of { T(r) — V(r) dr. Note that, because of 
conservation of energy, these extremals must be found within the class of all 
curves with constant energy Ho. The first step is to observe that, from what 
we have said above, 


/ nS Ves / 2T (r) — Ho dr. 


Furthermore, since the derivative of a constant vanishes, adding a constant to 
an integrand has no effect on the Euler-Lagrange equations. Therefore, finding 
extremals for either side of this equation is equivalent to finding extremals for 


pro dr = [rc —V)dr 
= [rt = v) oat 
= [ 2 - vy)————— a V Ei? + Gi? dt 


= / AVA 2(Ho -_ V) Eu + 2(Ho — V)Gv? dt 
= if V Eu + Go? dt 


where E = 2(Hp —V)E and G = 2(Hp — V)G defines a new metric on 
M conformal with respect to the the original E and G. (Here, we assume 
that Hp > V in this region of the parameter domain.) It is now clear that 
finding an extremal for fT — V dr corresponds to finding an extremal for 
f VEU? + Go? dt. But, by Theorem 5.1, f VEu? + Gi? dt has extremals 
which are geodesics on M with respect to the metric E = 2(Ho — V)E and 
G = 2(Hyp — V)G. Therefore, we have the following special F' = 0 case of a 
result of Jacobi. 


Theorem 5.3. Let a denote the path of a particle with constant energy Ho 
under the influence of a potential V constrained to lie on a surface M with 
metric E, F = 0 and G. Then, for Hp > V, a is a geodesic on M with respect 
to a conformal metric 


E=2(H)-V)E F=0 G=2(H)-V)G. 


This theorem, while hardly the means to an explicit description of equations of 
motion, nevertheless allows theoretical results about geodesics to be applied 
to mechanical systems. In particular, one of the most important questions 
about a mechanical system is whether or not there is a periodic orbit — that 
is, does some particle, say, return to its initial position after a certain amount 
of time. By the theorem above, this question is equivalent to asking whether a 
surface M (with any metric) has a closed geodesic — a geodesic a: [a,b] > M 
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with a(a) = a(b). In 1951, Lusternik and Fet [LF] (also see [K]]) proved a 
general theorem which, in particular, says that any compact (i.e. closed and 
bounded) surface does indeed have a closed geodesic. Applying this result to 
a surface in conjunction with Jacobi’s theorem gives 


Corollary 5.4. If M is a compact surface and V(u, v) is a potential function 
with Ho > V(u,v) for all (u,v) in the parameter domain of M, then there 
exists a periodic solution to the equations of motion of a particle constrained 
to move on the surface under the influence of V. 


Example 5.2: The two body problem. The following comes from [P]. 
Suppose that two bodies are mutually attracted with potential V depending 
only on the distance between the bodies. Mechanics tells us that the problem 
may be reduced to the case of single body planar motion in a central force 
field. The Jacobi metric for the motion along a constant energy curve has 
conformal scaling factor (see Chapter 5.4) f = 1/,/2(Ho — V) so that Exercise 
5.4.1 gives the Gauss curvature, | 


Ll Vig Von V2+V? 


N= Gp =v (ov 


EXERCISE 5.2. Use Exercise 5.4.1 to verify the formula for Gauss curvature above. 
This curvature is called the mechanical curvature. Further, show that if V has a 
local minimum at (uo, vo), then K (u,v) > 0 for all (u, v) sufficiently close to (uo, vo). 
Similarly, K(u,v) < 0 near a local maximum. 


EXERCISE 5.3. Let V = —1/Vu? + v2 be the Newtonian potential in a plane. 
Show that 


a 1 eee eee 
Vas + Vy ae (u2 ae v2)? and Vuu + Vov 7 (u2 + y2)3 5) 


By Exercise 5.3, the Newtonian potential V = —1/V/u? + v2 gives 


Ho Ho 


a TV testis Hor +1) 


where r = Vu? + v? is the radial distance from the origin. Now, the constant 
energy equation (1/2)v? —1/r = Ho gives v? = 2(Hor + 1)/r > 0. Hence, the 
denominator of K is positive. Therefore, 


H >0sSKkK <0 
Hj =0ekK=0 
H<0eK>0. 
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For the two body problem, orbits are characterized in terms of energy (see 
[PC]). Putting this together with the above gives 


K<0 #& Ho>0O + the orbit is hyperbolic 
kK=0 & Ho=0 = the orbit is parabolic 
K>0O #& Ho<0O $= ' the orbit is elliptical. 


Therefore, orbits are characterized in terms of their Gauss curvatures. In 
particular, a point where K > 0 guarantees a periodic orbit. 


Before we end our discussion of Jacobi’s theorem, we should point out ex- 
plicitly that everything we have said holds true in higher dimensions as well. 
Also, it should be noted that one of the main features of Einstein’s geometriza- 
tion of gravity is his assertion that particles, planets and light photons, say, 
travel along paths which are geodesics in a region of spacetime whose metric 
is determined by the mass-energy in that region. Considerable philosophi- 
cal hindsight, therefore, allows us to view Einstein’s approach as a natural 
descendent of Jacobi’s theorem. 


To end this section, we want to exhibit yet another of our earlier examples 
as a constrained variational problem. In Chapter 3.6 we considered the sur- 
faces of revolution with constant mean curvature — the so-called surfaces of 
Delaunay. Recall that these surfaces were characterized by a certain differen- 
tial equation, 
2ah 


V1+h? 


where a and b are constants. Here, h = h(u) denotes the profile curve (or 
meridian in the plane) of the surface of revolution. Let’s consider the follow- 
ing problem. For surfaces of revolution, fix a volume VY = 7 f{ h(u)? du and 


minimize the surface area S = 2m f h(u)\/1+h’ (u)* du. What are the re- 
sulting surfaces? The usual variational setup for such a constrained problem 
(neglecting 7 in the formula) gives 


h?+ = +b? 


Minimize J = [2m 1+h'(u)* — AR(u)? du. 


Theorem 5.5. The extremals for the variational problem above are the sur- 
faces of Delaunay. 


Proof. Since the integrand does not depend on the independent variable u, 
we may use the first integral f —z(Of/Oz) = c in place of the Euler-Lagrange 
equation. 
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V1+h? 
2h(1 + h’?) — \n2V1 + bh’? — 2hh! =ceV1 +h” 
2h = (c+Ah?)V1 +h” 


/ 
QhV1 +h’? — \h? — hi! Ga =¢ 


where a = —1/X and b = —c/X. Of course, this is precisely the equation above 
characterizing surfaces of revolution of constant mean curvature. OC) 


As we mentioned in Chapter 3, many one-celled creatures exhibit rota- 
tional symmetry and shapes which seem close to surfaces of Delaunay. D’Arcy 
Wentworth Thompson asked whether the minimization of surface area under 
the constraint of fixed volume together with a certain biological tendency to- 
ward symmetry might be responsible for the observed shapes. Of course we 
have seen in Chapter 4 that compact surfaces of constant mean curvature are 
spheres, so Thompson needed other biological considerations to allow for the 
physical existence of rounded-off “compactified” versions of surfaces of Delau- 
nay. More information on this approach to cellular morphology may be found 
(of course) in [DWT] as well as in [HT]. 


8.6 THE PONTRYAGIN MAXIMUM PRINCIPLE 


The subject of optimal control theory provides a sightly different perspec- 
tive on variational questions from the calculus of variations. We will only give 
a brief outline of one of the main results in optimal control — the Pontryagin 
maximum principle — and how this result may be applied in geometry. The 
true power of the Pontryagin maximum principle is seen in its application to 
discontinuous problems (i.e. the so-called bang-bang controls) such as those 
found in rocket motor control or trajectory control, but we shall only con- 
sider simple geometric applications here. For an elementary introduction to 
the principle’s nongeometric applications, see [Pin]. Also, we shall restrict 
ourselves to two dimensional problems as we have done so far. 

The main problem in optimal control is to find a path (or trajectory) x = 
(zi(t),Z2(t)) which satisfies the system of differential equations 


£1 = fi(x,u) £2 = fo(x, u) 
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(where u = (uj(t), u2(t)) is the control) and which minimizes a given cost 
function 


ty 
J= fo (x, u) dt. 
to 
Note that we have reverted to the control theorist’s subscripts in place of 
the geometer’s superscripts for coordinate functions. This should ease the 
transition to further study in optimal control. Also, a typical optimal control 
problem may well fix the initial and final states of the system, x(to) = xo 
and x(t,) = x1, as well as the initial time tp. The final time t, is typically 
undetermined. For example, if a spaceship is to make a landing on the moon 
in the shortest time, it must satisfy a system of differential equations derived 
from Newton’s Law F = ma together with the thrust control of its rockets. 
Because time is to be minimized, and starting from to = 0, the cost is J = 
i 1 dt = t1, so the final time 1, is not fixed initially, but is itself determined 
by solving the problem. See [Kir p. 247] for a discussion of this problem. 
Now, how would a variationalist approach this problem? We have already 
seen that various types of constraints may be placed on variational problems, 
so it seems reasonable to simply incorporate the system of differential equa- 
tions into the cost integral as a constraint. Because z;(t) = f;(x,u), it is 
certainly true that 


" de) (elt) — fal u)) dt = 0 


to 


for time dependent Lagrange multipliers ~;. Therefore, in order to solve 
the optimal control problem above, instead solve the constrained variational 
problem 


ty 
Minimize J= | fo + Wi(41(t) — fu(x, u)) + Yo(ao(t) — fo(x, u)) dt. 
to 


To solve this problem, we will write down the usual first integral for t- 
independent integrands, the Euler-Lagrange equations (for four functions 21 (t), 
Z(t), ui(t), w2(t)) and then take into account the endpoint conditions, ¢; un- 
determined and x(t;) fixed. 


Because the integrand of J does not involve t explicitly, we may write 
fo + v1 (41 (t) — fix, u)) + Yo(£2(t) — fo(x, u)) — f141 — foy2 = 
or, rather 


(1) fo — 1 fi — We fo =. 
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The Euler-Lagrange equations with respect to 11, Z2, u, and u2 are 


(2) b= 58 58 — 52 
(3) in= 52 - iso - 52 
(4) = 58 54-52 
(5) = 58 54 52. 


EXERCISE 6.1. Verify the Euler-Lagrange equations above. 


In Remark 1.4 we mentioned that the general condition for extremizing 
J = f f dt subject to the terminal endpoint at t; lying on a given curve a(t) 
is 


(ts) + (@(tr) ~ a(t) E(t) = 0. 
In our case a(t) = x(t,), so @ = 0 and we have 
Fits) — #1) S(t) =0. 


We must use a multivariable version of this condition of course. Noting that 
the integrand of J does not depend on w; or ug and using F = fo+q1(41(t) — 
fi(x, u)) + Yo(L2(t) — fo(x, u)), we obtain an endpoint condition 


OF OF 
F(t1) —4 Pally 6 ee —(t;) = 
(t1) — £1 (t1) Dn (t1) — £2(t1) Diy (ti) =0 
which is then written out as 


(6) fo(t1) — oi (t1) fi(ti) — Yo(ti) fo(ti) = 0 


Now, for convenience, define the Hamiltonian associated to the problem to be 


H=-fotdifit ye fe 
and note that equations (1) — (6) imply the following relationships. 


(i) H(t, x, u) = 0, by (1) and (6). 
é OH . OH 
(ii) a ee 


(iii) OH/Ou, = 0 and 0H /Ou2 = 0 along an optimal trajectory. 


8.6 The Pontryagin Maximum Principle 297 


This heuristic discussion may be strengthened considerably to give the follow- 
ing necessary conditions for optimality. 


Theorem 6.1: The Pontryagin Maximum Principle. Suppose u(t) is a 
control which transfers the system with 


State Equations £1 = fi(x,u) Lo = fo(x, u) 


from fixed state xo to fixed state x, along a path x(t). If u(t) and x(t) 
minimize the cost : 
1 
fo(x, u) dt, 
to 
then there exist ~, and wz so that the Hamiltonian H = —fo+%1 fit 2 fe 
obeys 


(i) A (1, 2,21, 22, Ui, U2) = 0 along the optimal trajectory. 


. OH OH 
ii) Co-state Equations =—->— =—-—. 
(ii) q V1 Bis 2 an 
(iii) For each t, H attains its maximum with respect to u at u(t). In 
particular, 


OH OH 
7 ae Bia” 


Example 6.1: Geodesics. Suppose we want to find geodesics on a surface 
M: x(u,v) with orthogonal metric FE and G. We can formulate the problem 
by saying that, from a starting point x9, we wish to control our journey 
to minimize arclength. Because we can write a curve on M in the form 
a(t) = x(u(t), v(t)), we see that we are really interested in controlling the 
functions u(t) and v(t). Therefore, the optimal control problem becomes, find 
a curve (u(t), v(t)) so that 


and the integral 
J= | vee + Gv? dt = / Eu? + Gul dt 


is minimized. The Hamiltonian associated to the problem is 


H = —4/ Eu? + Gu2 + 1 uy + 2 ue 
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with co-state equations 


OH E,ui+G,u3 _ OH  E,ut+ Gyus 


ne Ou 2,/Eu? + Gu3 ee Ou 2.,/Eu? + Gu3 


and u-critical point equations for H, 


OM eon ee ey fj rae a Ga 
Our / Eu? + Gus : / Eu? + Gus > Bu 
These last equations say that ~, = Eu,/.Eu?+Gu% and v2 = 


Gu2/ /Eut + Gu2. Replacing y, and we by these quantities and u; and 
u2 by u and v in the co-state equations gives 


< ( Eu \_ E,ywW+G,0* 
dt\ JEW+tGw?) 2wWEW+Gi 
<( Go )- Ey tv? + G,v? 
dt\ JEwW+Gi?) w2wWEW+GR 


These are, of course, the “geodesic equations” arising in the proof of Theorem 
5.1. Hence, the optimal trajectories for this problem are geodesics. 


Remark 6.2. With respect to the metric on M, the amount of effort ex- 
pended to move from one point to another along an optimal path (ie. a ge- 
odesic) is precisely f lulz ¢ dt = f / Eu? + Gu2 dt = J. Therefore, geodesics 
may be considered to be paths along which a minimum amount of control 
is needed. Compare this with fuel optimal control problems in [Pin] and 
[Kir]. Also, note that we have directly obtained geodesics as locally deter- 
mined minimizers of arclength here. The controls u; and u2 (and thus the 
cost J = f \/Eu? + Gu dt) are local in the sense that we recreate the correct 
u(t) and u(t) from the state equations and cost, both of which rely on lo- 
cal control information. In a way, this is very different, philosophically, from 
trying to find a minimizer among all curves joining given endpoints — the 
underlying concept of the calculus of variations. 


EXERCISE 6.2. Show that an arclength minimizer between (1,2) and (2,3) in the 
plane is the straight line y = x + 1 by forming the optimal contro] problem 


L= uy y= u2 


Minimize J = Jv ue + ue dt 


and solving by the Pontryagin maximum principle. 
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EXERCISE 6.3. Show that a least area surface of revolution is a catenoid by forming 
the one dimensional optimal control problem 


y =u, Minimize J= fy V1+ 0? dz 


and solving by the Pontryagin maximum principle. Hint: derive the differential 
equation yy” — y/” —1=0 and let y’ = w so that y” = w (dw/dy). 

EXERCISE 6.4: Newton’s Aerodynamical Problem. Newton asked what the optimal 
shape was for a surface of revolution passing through, what he called, a rare medium. 
Here, the word optimal is taken to mean “offers the least resistance to movement 
through the medium”. Also, it should be noted that usual media such as water or 
the lower atmosphere do not qualify as rare while the stratosphere does. Newton’s 
problem may be cast in the language of optimal control (see [Tik], [ATF, p. 15] for 
a derivation). Solve 


y =u, u>0; y(0)=0, y(a) =, 


a, 
x 
=f iw dz, 


where y = y(x) represents the profile curve of the surface of revolution. Solve 
Newton’s problem by determining y using the Pontryagin maximum principle. Hint: 
after writing down the conditions of the principle, consider the minimum of the 
function 


to minimize cost 


w 
OTe oe 
What is the minimum for various w’s? MAPLE may be useful. Note u > 0. Show 
u = 0 is a local minimum at least. How do minima come about? After z is 
determined parametrically in terms of u, find y, using 


dy _dyde de 


du dodu ° du’ 


Remember to find the constant of integration. The answer to the problem may be 
found in Exercise 1.1.4. 


Since optimal control theory arose in the 1950’s in response to the problems 
of rocket and satellite trajectory control, it has revitalized the calculus of 
variations. We have presented only the barest hint of the power and beauty of 
this subject, but surely, within it hide many more applications to the geometry 
of surfaces. 
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8.7 THE CALCULUS OF VARIATIONS AND MAPLE 


In this section, I want to present some MAPLE procedures which are use- 
ful in finding and solving the Euler-Lagrange equations. Further, I’ll apply 
these to the problem of a marble rolling inside a (frictionless) hemispherical 
bowl under the influence of gravity and the result will be the plot of the mo- 
tion of the marble. This final stage will be a modification of the “plotgeo” 
procedure used to plot geodesics on surfaces. First, let’s take a look at the 
Euler-Lagrange equation in one variable. 


> EL1:=proc(f) 

local part1,part2,p2,dfdx,dfddx; 
part1:=subs({x=x(t),dx=diff(x(t),t)} diff(f,x)); 
dfdx:=diff(f,dx); 
dfddx:=subs({x=x(t),dx=dx(t) },dfdx); 
p2:=diff(dfddx,t); 
part2:=subs(dx(t)=diff(x(t),t),p2); 
RETURN(simplify(part1-part2=0)); 

end: 


VVVVVVVV 


Notice that we have to do a bit of substituting to ensure that the answer 
comes out in a form MAPLE recognizes as a differential equation. You might 
ask, why didn’t we input the function f in differential form? The answer is 
just that we become used to writing f(t,z,£) = 2? — 2? — 2zsin(t) for the 
integrand of a variational integral. Rather that write z as d(x(t))/dt (where 
we also have to tell MAPLE that z is a function of t), it seems easier to write 


> f:=x7 2-dx” 2-2«xxsin(t); 


f := 2? — dx* — 2zsin(t) 


and understand “dz” to mean x. Further, the procedure above, once we take 
the time to write it, takes care of making z a function of t and writing the 
final version in differential equations form. Here’s an example. 


> EL1(f); 


2x(t) — 2sin(t) + 2 (G0) =0 


dt? 
> dsolve({EL1(f)=0,x(0)=1,x(Pi/2)=2},x(t)); 


sin(t) 


re) cos(t) + 2sin(t) + cos(t) 


x(t) = — arctan ( 
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This is another instance where MAPLE produces output which is not in the 
best form. If you recall that 


sin(t) 


tan(t/2) = 1+ cos(t) 


then you will see that the true solution of the Euler-Lagrange equation is 
t 
a(t) = -3 cos(t) + 2sin(t) + cos(t). 


EXERCISE 7.1 Carry out the following MAPLE commands to extremize h(t, x, 2) 
= <7 /t? subject to the conditions z(1) = 2 and 2(2) = 17. 


> h:=dx" 2/3; 
> EL1(h); 
> dsolve({”=0,x(1)=2,x(2)=17},x(t)); 


EXERCISE 7.2 Extremize h(t, x, 2) = 2? +22 sin(t) subject to the conditions x(0) = 
0 and z(7) = 0. 


The following procedure handles the case where f does not depend on t. 


> EL2:=proc(f) 

local part1,part2,dfddx ; 

part1:=subs({x=x(t) ,dx=diff(x(t),t)},f); 
dfddx:=diff(f,dx); 

part2:=diff(x(t),t)*subs({x=x(t), dx=diff(x(t),t)},dfddx); 
RETURN(simplify(part1-part2=c)); 

end: 


VVVVVV 


For example, 


> g:=sqrt(1+dx~ 2)/sqrt(x); 


> EL2(g); 


1+ (da(t)) Ve) 


A small problem now arises because we have set the equation equal to c. 
MAPLE has no idea what c is, so if we now try to solve the differential 
equation, MAPLE doesn’t know what to do. There are two ways to get 
around this difficulty. The first is to use the fact that c is a constant to form 
a new differential equation obtained by differentiating with respect to ¢ and 
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thus annihilating c. To do this, we must peel off the left side of the output of 
EL2(g). This can be accomplished by noting that the output is made up of 
two operands, the left side (operand 1) and the right side (operand 2). The 
command “op(1,EL2(g))” takes the left side. Hence, 


> dsolve(diff(op(1,EL2(g)),t)=0,x(t)){1]; 


| C1 C1 
t=-,/-1+ ne) — Clarctan ( —1+ &) —C2 


I should point out that MAPLE provides an easier way to take left or right 
sides of equations. Namely, the commands “lhs” and “rhs” may be invoked. I 
have chosen to use the “op” command above because it comes in handy when 
many operands are involved and one of them is to be isolated. Therefore, I 
think it should be seen at least once. Also, notice that there is a [1] attached to 
the end of the command. This is because MAPLE provides a set of solutions 
for the differential equation and I only wanted the first. Try the command 
without the [1] to see what I mean. The second way to solve the equation is 
to get rid of the c by taking the left side and then put it back in explicitly as 
follows. Try this. 


> dsolve(op(1,EL2(g))=C,x(t))[1]; 


I don’t happen to like this way because if you want to do anything else with 
this result, MAPLE still has trouble knowing what C is. 


EXERCISE 7.3 Extremize k(t, x, +) = <*/x° subject to the conditions (0) = 1 and 
z(2) = 4. Do the problem by hand and then carry out the following MAPLE com- 
mands. This will give you an idea of some pitfalls involved in the use of computers 
to symbolically solve diffential equations. (By the way, when you see a double quote 
“? in a command, this means that MAPLE will take the previous output as input.) 


> k:=dx72/x73; 

> EL2(k); 

> dsolve(op(1,EL2(k))=C,x(t)); 

> dsolve({diff(op(1,EL2(k)),t)=0,x(0)=1,x(2)=4},x(t))[1]; 
> dsolve( {diff(op(1,EL2(k)),t)=0,x(0)=1,x(2)=4},x(t)) [2]; 
> dsolve(op(1,EL2(k))=C,x(t)) [1]; 

> dsolve(op(1,EL2(k))=C,x(t)); 

> subs(x(t)=x,”); 

> ss:=solve(” ,x); 

> eql:=subs(t=0,ss)=1; 

> eq2:=subs(t=2,ss)=4; 

> sols:=solve({eql,eq2}); 

> simplify(subs(sols[1],ss)); 

> simplify(subs(sols[2],ss) ); 
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The following example shows how isoperimetric problems may be handled. 
The problem is to minimize J = [ #? dt with x(0) = 2, z(1) = 4 and subject 
to the constraint f xdt = 1. 


> k1:=dx~2-lambda«x; 


k1 := dx? — rx 
> EL1(k1); 


2 
—A +2 (<aet)) = 0 
> dsolve({EL1(k1),x(0)=2,x(1)=4},x(t)); 


a(t) = ~4x? +2+ (Fa + 2) t 
> op(2,dsolve({EL1(k1),x(0)=2,x(1)=4},x(t))); 
1 1 
—pe +24 (3 + 2) t 


> int(” ,t=0..1); 


> solve(” =1,lambda); 


—48 
> subs(lambda=” ,dsolve({EL1(k1),x(0)=2,x(1)=4},x(t))); 


a(t) = 12t? + 2— 10t 


Now let’s look at the two variable case. In fact, I should say cases since we 
may want to choose different forms of the Euler-Lagrange equations depending 
on whether the integrand is x or y independent. The following procedures deal 
with lagrangians having 2 t-dependent functions z and y and their derivatives 
xz and y. The first case is when both Euler-Lagrange equations should be 
computed. 


> ELsys1:=proc(f) 

> local ell,el2,part1,part2,part3,part4,p2,p4,dfddx,dfddy; 

> part1:=subs({x=x(t), y=y(t),dx=diff(x(t),t) ,dy=diff(y(t),t)}, 
diff(f,x)); 

> dfddx:=subs({x=x(t),dx=dx(t),y=y(t),dy=dy(t)} diff(f,dx)); 

> p2:=diff(dfddx,t); 
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part2:=subs({dx(t)=diff(x(t) ,t) dy (t)=diff(y(t),t)},p2); 
ell:=simplify(part1-part2=0); 


part3:=subs({x=x(t),y=y(t) ,dx=diff(x(t),t),dy=diff(y(t),t)}, 
diff(f,y)); 

dfddy:=subs({x=x(t) ,dx=dx(t) ,y=y(t),dy=dy(t) } ,diff(f,dy)); 

p4:=diff(dfddy,t); 

part4:=subs({dx(t)=diff(x(t),t) dy (t)=diff(y(t) ,t)},p4); 

el2:=simplify (part3-part4=0); 

RETURN(ell1,el2); 

end: 


The third case is when the second Euler-Lagrange equation should be com- 
puted, but the lagrangian is independent of z. 


> ELsys3:=proc(f) 

> local ell ,el2,part3,part4,p2,p4,dfddy; 

> p2:=simplify(subs({x=x(t),dx=dx(t),y=y(t),dy=dy(t)}, 
diff(f,dx))=c); 

> ell:=subs({dx(t)=diff(x(t),t) dy (t)=diff(y(t),t)},p2); 


> part3:=subs({x=x(t),y=y(t),dx=diff(x(t),t),dy=diff(y(t),t)}, 
diff(Ey)); 

dfddy:=subs({x=x(t),dx=dx(t),y=y(t),dy=dy(t)} ,diff(f,dy)); 

p4:=diff(dfddy,t); 

part4:=subs({dx(t)=diff(x(t),t),dy(t)=diff(y(t),t)},p4); 

el2:=simplify(part3-part4=0); 

RETURN(el1l,el2); 

end: 


VVVVVV 


EXERCISE 7.4. Write MAPLE procedures which handle the second and fourth cases 
of the Euler-Lagrange equations for two variables. Namely, handle the cases when 
the integrand f is independent of y and dependent on z and when it is independent 
of both variables. 


We might also wish to return a portion of an equation unevaluated. For ex- 
ample, we may want d/dt unevaluated. In this case, we can replace “diff(-,t)” 
with “Diff(-,t)” in the expressions for “dfddx” and “dfddy” above. Now let’s 
apply the Euler-Lagrange procedure to the situation of a marble rolling in a 
hemispherical bowl. Recall that the action integral is 


J= [pin FP cos? vi? + 0”) — mgRsinv dt. 
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where the bowl has radius R, the marble has mass m and g is the gravitational 
constant of the Earth. The Euler-Lagrange equations for this two variable 
problem are 


Cc g 


, * ; .2 
“a= ————— and ¥+—cosv+sinvcosvu’ = 0. 
mR? cos? v R 


These are then the equations of motion of the marble. In Figure 8.4 the 
trajectory corresponding to these equations with g = m = R = 1 is shown. 
Now let’s see how this trajectory was obtained by using MAPLE. 


> with(plots): 
The following is a parametrization for the sphere (R = 1). 
> sph:=(cos(u) «cos(v),sin(u)*cos(v),sin(v)]; 


The following series of procedures calculates the Euler-Lagrange equations 
for the action integral constrained to the surface of the sphere. Of course, 
since we need the metric, we must also include the usual procedures necessary 
for its calculation. 


> dp := proc(X,Y) X{1]*Y[1]+X[2]*«Y[2]+X(3]*«Y(3] end: 


> 

> nrm := proc(X) sqrt(dp(X,X)) end: 
> 

> xp := proc(X,Y) 


local a,b,c; 

a := X[2]«Y[3]-X[3]*Y([2]; 
b := X[3]*Y(1]-X[1]*Y([3}; 
ce := X(1)*Y[2]-X[2]*Y[1); 
eee 


Jacf := proc(X) 
local Xu,Xv; 
Xu := [diff(X[1],u),diff(X[2],u) diff(X[3],u)]; 
Xv := [diff(X[1],v) ,diff(X[2],v) diff(X[3],v)]; 
atch [Xu,Xv]); 
EFG := proc(X) 
local E,F,G,Y; 
Y := Jacf(X); 
E := dp(¥[1], Y(1)); 
F := dp(¥[1],¥[2)); 
G = dp(Y2},Y12)); 
es 


VVVVVVVVVVVV VV VV VV VV 
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The following procedure gives the Euler-Lagrange equations for the action 
integral. By Hamilton’s principle, these equations must be the equations of 
motion of the associated physical system. Notice that we use ELsys3 for 
the Euler-Lagrange equations since the constrained integrand T — V has no 
dependence on u. In other situations, other cases of the ELsys procedures 
may be necessary. 


> Equat:=proc(X) 

local Metric, TT, VV,LL,Eul,eq] ,eq2; 
Metric:=subs({u=x,v=y},EFG(X)); 
TT:=1/2*(Metric(1]*xdx~ 2+ Metric[3]*dy~2); 
VV:=subs({u=x,v=y},X[3]); 
LL:=TT-VV; 

Eul:=ELsys3(LL); 
eq1:=subs({x(t)=u(t),y(t)=v(t)},Eul([1]); 
eq2:=subs({x(t)=u(t),y(t)=v(t)},Eul[2]); 
RETURN(eq]1,eq2); 

end: 


VVVVVVVV VV 


For example, the equations of motion for the sphere-marble problem are ob- 
tained by 


> Equat(sph); 


cos(v(t))? (Sut) =c¢ 


~ cos((t)) (Zuo) sinoen - cos(u(t)) = (aut) =o 


Here c is a constant dependent on the initial speed at which we propel the 
marble. Different c’s should be expected to produce different motions. The 
next procedure allows us to plug the numerical solution of the equations of 
motion into the definition of a surface X and plot both the surface and the 
motion. This procedure is almost identical to “plotgeo” which allowed us to 
plot geodesics on surfaces. The only real differences are that we must be able 
to plug in a value for c (and this necessitates some finagling with the definition 
of “sys” and the way we provide a set of equations for “desys”) and that the 
u-equation is first order. Hence, for “plotmotion,” there is no “Du0” input. 
Again, different problems may require modification here. 
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> plotmotion:=proc(X,const,ustart,uend,vstart,vend,u0,v0,Dv0,n,gr, 
theta,phi) 
> local sys,desys,dequ,deqv,listp,j,motion,plotX; 
> sys:=[subs(c=const,Equat(X)([1]),Equat(X)[2]]; 
> desys:=dsolve({sys(1],sys[2],u(0)=u0,v(0)=v0,D(v)(0)=Dv0}, 
{u(t),v(t)},type=numeric,output=listprocedure); 


> dequ:=subs(desys,u(t)); deqv:=subs(desys,v(t)); 

> listp:=[seq(subs({u=dequ(j/n{3]),v=deqv(j/n[3])},X), j=n[1]..n[2])]; 

> motion:=spacecurve({listp}, color=black,thickness=2): 

> plotX:=plot3d(X,u=ustart..uend,v=vstart..vend,grid=([gr[1],gr[2]]): 

> display ({motion,plotX},style=wireframe,scaling=constrained, 
orientation=[theta, phi]); 

> end: 


Again, just as in “plotgeo,” note that the inputs “n” and “gr” are vectors. 
Figure 8.4 comes from the command 


> plotmotion(sph,1,0,2*Pi,-Pi/2,0,0,-Pi/4,0,[0,200,20],[30,20],0,60); 


EXERCISE 7.5. Carry out the following MAPLE command and explain the result 
in terms of the chosen c and the equations of motion (for this c!). 


> plotmotion(sph,1/(sqrt(2)*2~ (1/4)),0,2*Pi,-Pi/2,0,0,-Pi/4,0, [0,150,20], 
[30,20],0,60); 


EXERCISE 7.6. Now do Exercise 4.8. Namely, find the equations of motion of a 
marble in a paraboloid shaped bow! and plot the motion for varying c. Try 


> par:=[v*cos(u),v*sin(u),v~ 2]; 
> plotmotion(par,1,0,2*Pi,0,2.5,0,2,0,[0,500,20],[30,12],0,60); 


Note that the parametrization is chosen so that “plotmotion” can be used without 


modification. 


Finally, let’s see how MAPLE may be used in optimal control. Let’s take 
one standard problem and see how MAPLE procedures may allow for calcu- 
lation and visualization of solutions. Here’s the problem. Suppose we are on 
a block of ice on an ice covered (so frictionless) pond and we want to get to 
the center of the pond as quickly as possible. It so happens that our ice block 
has a rocket motor attached to it which can be fired forward or backwards. 
The question then is, what is the optimal way to fire the rocket so that our 
ice block ends up at the center of the pond in the least time? 
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Newton’s Law F = ma tells us a fundamental relation between our con- 
trol — the firing of the rocket — and the acceleration of the ice block. Instead 
of dealing with this second order equation, we can use a standard trick to 
transform the problem into a system of first order equations. Namely, let x1 
denote the distance coordinate (from our position to the pond’s center) and 
x2 = £1 denote x1’s velocity. Then, with |F'| = u the control, Newton’s Law 
becomes 

ch 72 and T2 = Ui 


Since we are trying to minimize time, the integral to be minimized is J = 
1dt = T. Also, and very importantly, our rocket has limitations on the 
thrust it can give. Let’s say that 


|F| = |ul <1. 


Now, the Hamiltonian is easy to write down, but let’s write a MAPLE proce- 
dure to do this and (if possible) solve the co-state equations. Indeed, we can 
solve the whole problem in a conceptual and visual manner by making use of 
MAPLE’s graphics capabilities. First, we need the following packages. 


> with(DEtools): with(plots): with(linalg): 


> Hamilton:=proc(fl,f2) 

local Ham,part1,part2,psil ,psi2: 
Ham:=collect(-1+psil(t)+*fl + psi2(t)*f2,u); 
part1:=-diff(psil (t) ,t)=diff(Ham,x1); 
part2:=-diff(psi2(t),t)=diff(Ham,x2); 
RETURN(H=Ham,dsolve({part1,part2},{psil(t),psi2(t)})); 
end: 


StateEq:=proc(f1,f2,U) 
local part1,part2: 
part1:=diff(x1(t),t)=subs({xl=x1(t),x2=x2(t),u=U},f1); 
part2:=diff(x2(t),t)=subs({xl=x1(t),x2=x2(t),u=U},f2); 
RETURN (factor(dsolve({part1,part2},{x1(t),x2(t)},laplace))); 
end: 


VVVVVV VVVV VV 


By the way, note that I solved the system of differential equations above 
using “dsolve(...,laplace),” which means that the explicit solution is obtained 
by laplace transform methods. You should look up the help on “dsolve” to 
see all the options available for this command. Remember, to access MAPLE 
help, type “?dsolve.” Now, let’s write the right sides of the state equations 
and find the Hamiltonian. 


Sf bex2- Zu: 


fl = a2 
{2% 
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> Hamilton(f1,f2); 
H =-1+yW1(t)x2 + p2(t)u, {W1(t) = —Cl, ~2(t) = C1 + C2t} 


The Pontryagin Maximum Principle tells us that the Hamiltonian along an 
optimal trajectory must always be a maximum with respect to u. But H is 
linear in u, so the maximum is attained at the endpoints; that is, when u = 1 
and u = —1. Before we use this information to plot solution trajectories, we 
can actually solve the state equations explicitly. (This is very rare for optimal 
control problems and only occurs here because this is a rather simple system.) 


> StateEq(f1,f2,1); 
{220 = 2(0) + t,01(t) = 01(0) + ©2(0)t + st} 
> StateEq(fl,f2,-1); 


{220 = £2(0) —t,x1(t) = 21(0) + ©2(0)t — se} 


Now we can plot the trajectories for u = 1 and u = —1 separately and then 
try to discern a combination of trajectories which get us to the origin quick- 
est. Trajectory plots may be accomplished through MAPLE’s “phaseportrait” 
command. The general format includes inputs which are the f1 and f2 of the 
state equations ([x2,1] below), the variables x1 and x2, the trajectory parame- 
ter (-4..4 below) and a list of points from which the trajectories begin. A point 
is given by three coordinates. The first coordinate gives a starting value for 
the trajectory parameter while the second and third coordinates are the usual 
cartesian coordinates of the point in the plane. For example, [0,2,0] starts the 
trajectory parameter at t = 0 from the plane point (2,0). Note that we give 
the plot structure a name and save it to be displayed later. (Remember the 
colon.) 


> phl:=phaseportrait([x2,1],[x1,x2],-4..4,{[0,0,0],0,2,0], [0,4,0],[0,6,0], 
(0,-2,0],{0,-4,0],[0,-5,0],[0,-1,0],[0,1,0],[0;0,0]}, thicknéss=2): 


Also, it is possible to graph the direction vector field corresponding to the 
system of differential equations. 


> field1:=dfieldplot([x2,1],[x1,x2],-3..3,x1=-5..9,x2=-5..5): 
Both of these pictures may now be displayed in a given viewing region. 


> display({ph1,field1}, view=[-5..9,-5..5],scaling=constrained); 


310 8. The Calculus of Variations and Geometry 


le RN Na at pane ame aes Paar ape Speer parame gn ona ae ee armen «sages Se etn oy ie 


ee = 


~~  S 
Se 


Cnn et nt en nent ee eee ee ee ee ee 


FIGURE 8.6. ‘Trajectories for u= —1 


The following commands give the u = —1 trajectories. 

> ph2:=phaseportrait((x2,-1],[x1,x2],-4..4,{[0,0,0],{0,2,0},[0,4,0], [0,5,0], 
(0,-2,0],{0,-4,0],[0,-1,0], {0,-6,0],[0,1,0],(0,0,0]}, thickness=2): 

> field2:=dfieldplot([x2,-1],[x1,x2],-3..3,x1=-9..5,x2=-5..5): 

> display ({ph2,field2},view=|-9..5,-5..5],scaling=constrained); 

> display({ph1,ph2,field1 ,field2} , view=[-9..9,-5..5], scaling=constrained); 


But now we can see the trajectories which get us to the origin and we can plot 
them alone as follows. We also include a trajectory starting from a point in 
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FIGURE 8.7. ‘Trajectories for u= 1 and u = —-1 


the plane to show what the solution to the original problem is. Namely, from 
any point in the plane, take the trajectory (whether 1 or —1) which travels 
toward the trajectory leading straight to the origin. Specifically, the picture 
below shows a starting point (2,2), so that we take a u = —1 trajectory until 
it intersects the u = 1 trajectory going to the origin. We then switch to the 
latter and go right to (0,0). In fact, for this problem, the switch time (and 
total time) may be calculated precisely. In general, however, it is the picture 
which provides the qualitative solution. For this, MAPLE is well suited. 


> ph01:=phaseportrait([x2,1],[x1,x2],-3..0,{[0,0,0]},thickness=2): 
> ph02:=phaseportrait([x2,-1],[x1,x2],-3..0,{[0,0,0]} ,thickness=2): 
> ph11:=phaseportrait([{x2,-1],[x1,x2],0..4,{[0,2,2]} thickness=2): 
> display({ph01,ph02,ph11},scaling=constrained ,view=(-4..4,-3..3]); 


FIGURE 8.8. The final solution path 


Chapter 9 


A GLIMPSE AT 
HIGHER DIMENSIONS 


9.1 INTRODUCTION 


Up to this point, we have only considered surfaces x(u,v) dependent on 
two parameters. In mathematics and the sciences, however, it is often the case 
that geometric structures depend on many parameters. Indeed, the number 
of degrees of freedom of a physical system tells us precisely the number of pa- 
rameters necessary to describe the so-called configuration space of the system. 
Just as we could use differential geometry to understand particles moving on 
constraint surfaces, so we would like to do the same for systems with many 
parameters. This means that we must invent a notion of higher dimensional 
surface which mimics the properties of the geometry we are comfortable with, 
namely the geometry of two-dimensional surfaces. In this chapter, we will 
consider these higher dimensional surfaces from a naive point of view with 
the goal of introducing the relevant notation and making the analogy with 
the two-dimensional situation. In this sense, this chapter is simply a dictio- 
nary for readers making the transition from the geometry of two dimensions 
to that of n dimensions. In particular, we will not introduce manifolds and 
their covariant derivatives in their most abstract generality, but instead stick 
(mostly) to the case of submanifolds of Euclidean space with the induced 
metric and covariant derivative. In this way, we hope the reader can be in- 
troduced to the geometry of higher dimensions while still maintaining some 
touch with reality. Basic references for this chapter are [DoC2], [Spi], [GHL], 
[Hi] and (for connections with physics) [CM]. 


9.2 MANIFOLDS 


Just as for two dimensions, we may define a k-patch, or coordinate chart, 
x: D — R"+! for a connected open set D C R*. Here, we assume that 


x(ul,...,u*) = a eee SG) 5 cece Ab eae ,ur)) 


is smooth (i.e. all partials of all orders exist and are continuous), injective 
(with continuous inverse on its image) and the tangent vectors of parameter 
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ax 2 (SS ial ) 


are linearly independent (i = 1,... ,&). We shall usually denote these tangent 
vectors, when the patch is understood, by 0; alone. A subset M kK CR+! ig 
a k-manifold if it is covered by k-patches with the property that, for any two 
patches x: D, — R"+! and y: Dy — R"*! with nonempty intersection, the 
composition transition map 


xt oy: y7! (x(Dx) Ny(Dy)) > Dx 


curves 


is smooth. Of course, when we say “covered by k-patches,” we simply mean 
that every point of M lies in the image of some patch. Also, the k-dimensional 
tangent space to M at p € M is simply the subspace of R"*! spanned by the 
linearly independent vectors 0;(p),... ,0,(p). The tangent space is denoted 
by T,M just as for surfaces. 


Example 2.1: The sphere S”. The n-sphere in R"*?! is defined as 
gr 2 f(at,... et )[(et)? +... + (ett)? = 1}. 


This is the direct analogue of S?, the 2-sphere in R?. Note that we call the 
sphere in R”+! the n-sphere instead of the n + 1-sphere. The reason for this 
terminology is that the sphere in R"*! is an n-manifold, not an n+1-manifold. 
So, the appellation “n-sphere” denotes the dimension of the manifold itself, 
not the dimension of the ambient, or surrounding, space. Let us now show 
that S” is a manifold of dimension n. We must cover S” by patches which 
satisfy the “transition property” above. 

Now, stereographic projection St: S" > R” works in R"*+1 just as in R°. 
Namely, denoting the North and South poles by V = (0,... ,0,1) and S = 
(0,... ,0,—1) respectively, we have the North and South projections 


St (a? ot!) = ee ated 
AE ae § =| eee ep 


1 n+1 t : x” 
Sista se ge = (hae) 

Let x(u!,...,u”) = St7}(ul,... ,u™): R® 4 S"—{N} and y(w’,... ,w”) 
= St7?(w!,... ,w”): R® — $” — {S} denote the respective inverses. Both x 
and y are smooth maps which are injective with smooth inverses (i.e. St) on 
their images. Also, it is easy to see that the parameter tangent vectors are 
linearly independent, so x and y are patches. Clearly, x and y cover S”, so 
we must only show the transition property to see that S” is an n-manifold. 
For this, note that x and y have the explicit representations 
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SE ace cid fc we, 
— Sui? 41? Sul 41) Sul? 41 


2w} 2u™ 1 -Sowi? 
14+ wi? 14 Dwi?’ 14 Dw? 


The transition function is then given by 


y(w',...,w”) = ( 


R" — {(0,... ,0)} ~A> 9" — {NS} St, RO 


and calculated to be 


1 n 
= es Uu U 
y Fox(ah t= ( rar an) G0 a) 


Therefore, each component has the formula 


a 


and, since the origin of R” is precluded from the domain of the transition 
map, each component function is smooth. Thus, $” is an n-manifold. 


EXERCISE 2.1. Verify the formulas above for stereographic projection, x, y and 
the transition yo ox and explain why the origin of R” is not in the domain of the 


transition. 


Now suppose that x: D, > R"*+1 and y: Dy — R"*! are two patches with 
nonempty intersection x(D,) N y(Dy) # @. Then, on the intersection, 


x(u!,...,u")=y(w',... ,w*) 
and y-'ox(u',... ,u*) = (w,... ,w*). Since each component function y® 
may be considered as a function of the u‘’s, we have by the chain rule, 
Oy® — Ox*® 
Out Ou 
_ 3 Ow) dys 
— St But Ows 
j=l 


This type of transformation rule is familiar to physicists who are interested 
in relating the physical quantities of one reference frame to those of another. 
Indeed, for this reason and because of the coordinate change formula above, 
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physicists in the old days often defined the mathematical objects they were 
interested in (i.e. “tensors” ) in terms of such transformation rules. If we write 
out the coordinate form of the formula above, we obtain 


($< | _ dw* (34 ee 


aut’? Aut J But \ Ow! Ow! 


4 Out (Buh Byh™™ 
"Out \Owk?"? Awk J - 
Written more compactly, we have 
k ; 
Ow) 
B= 2 Bye AY 
j=l 


and this formula shows that the coordinate change rule allows us to change 
the basis of the tangent space T,M via the matrix 


dwt, Owh wt 
Out Out Ouk 
Ow! 
Out 
Owe awk ~ aw* 
dul Out Our 


which we denote by J(u, w) since it is just the Jacobian matrix of multivariable 
calculus associated to a coordinate change. This formulation allows us to 
make the notion of orientability (see Chapter 2) precise. Say that a manifold 
M is orientable if there is a collection of patches {xy}ae.4 which cover M 
such that the Jacobian matrices for all possible transitions xz1 o xg have 
positive determinant (evaluated at all points in the overlap). Intuitively, we 
can think of this requirement as the analogue of saying that a rotation keeps 
an object oriented the same way while a reflection produces an oppositely 
oriented mirror image. One thing to notice is the following 


Proposition 2.1. Suppose that the intersection of two patches is path con- 
nected. In particular, suppose x(Dx) NM y(Dy) #@. Then the determinant of 
the associated Jacobian does not change sign when evaluated at any point in 
the intersection. 


Proof. We will give a rather sophisticated (as opposed to computational) proof 
here because the same ideas occur often in the geometry of higher dimensions. 
Choose two arbitrary points p and q in the intersection x(Dx) M y(Dy) and 
take a path a: I > x(Dx) N y(Dy) C M with a(0) = p and a(1) = qg. Now, 
just as in Lemma 2.1.1, we may write 


x(ul(t),... ,u*(t)) = a(t) = y(wi(t),... ,w*(t)) 
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with y~1ox(ul(t),... ,u*(t)) = (wi(t),... , w*(t)). In this way the Jacobian 


Huw) = (Fe ®) 


is dependent on t as well and the determinant becomes a continuous func- 
tion det J(u, w): I — R (since the partials are continuous). Now, J(u,w) € 
GL(k, R), the group of invertible k x k-matrices over R, so det J(u, w)(t) #0 
for all t. Therefore, by continuity (and the intermediate value theorem), the 
continuous function det J(u, w) cannot take both positive and negative values. 

O 


Corollary 2.2. Ifa manifold M is covered by two patches whose intersection 
is path connected, then M is orientable. 


Proof. We must only show that one single J(u, w) has positive determinant. 
Suppose it does not. Then, by either changing the sign of one coordinate 
function or interchanging two coordinate functions in one of the patches, we 
change the sign of the determinant of J(u,w). This follows from the usual 
properties of determinants; namely, if a row (or column) is multiplied by —1 or 
if two rows (or columns) are interchanged, then the sign of the determinant is 
changed. Note that the connectedness of the intersection and the proposition 
above ensure that we must only check the positivity of det J(u, w) at a single 
point. O 


Example 2.2: The sphere S”. For the patches x and y of Example 2.1, 
the transition function was calculated to be 


1 


u u” 
wu a) = eee sw), 


The Jacobian matrix may be calculated by first observing that 


YORU pclae = ( 


dw) — (So, ue?) 6) — Quiui 
dui (Ss. us?)2 
where 6/ = 0 ifi#j and 6] =1ifi=j. (That is, 6 is the Kronecker delta.) 


By Proposition 2.1, we need test the Jacobian at a single point, so choose 
(1,0,... ,0). Then, we have 


29, iG: ee, sf 
Gi Stet: 76 
J(U,0)(1,0,..0) =) 2, 
O90) gee FE 


This matrix has determinant —1, so the given patches do not orient S”. 
However, by Corollary 2.2, since the patch intersection is the path connected 


9.3 The Covariant Derivative 317 


set S” — {N,S}, S” must be orientable. Indeed, by the proof of Corollary 
2.2, we can redefine y to be a new patch 


ae k 2w 2w 2w 
y(w,...,w') = Soap ee ED Be en OO See: ew ae AOD 
1+>>w* 14+>5u* 14+ >> 


2w” 1 Es) 


Tp ae 


2 1 


obtained by switching the first two coordinate functions. The Jacobian matrix 


at (1,0,... ,0) now has its first two rows switched, 
O. tS 
a9: . SOP! xk 2G 
DUGG EO | on cays, Gen Aes 
O° °O.. aes. 32 


and det J(u, v)(1,0,...,0) = 1. The patches x and y therefore orient S”. 


9.3 THE COVARIANT DERIVATIVE 


From now on we will confine ourselves to the image of a single patchx: D C 
R* — R”+! on a k-manifold M*. Hence, from now on, we are doing local 
differential geometry. From Proposition 2.1, we see that x is automatically 
orientable since D, and hence its image x(D) are connected. For the patch x 
(which we may also refer to as M), the parameter tangent vectors 01,... , Ox 
form a basis at each point for the tangent space T,M. We may extend this 
basis for T,M to a basis for R"*+? by choosing vectors, Uj,... ,Un4i-z in 
R”++ such that 


U,- 0; =0 for all s and 7 


and 


U0 = 6 where 6 is the Kronecker delta. 


The U;,’s are then normal vectors for M. If k = n, then only one normal vector 
U exists and, in this case, M is said to be a hypersurface. All surfaces in R® 
are thus hypersurfaces. Recall that a smooth vector field V on M is simply 
the assignment of an R”+! vector to each point of M so that V: M — R”+} 
is smooth. If the assigned vectors are always tangent to M, then V is said 
to be a tangent vector field. The vector fields 0,,... ,@, are tangent vector 
fields on M* while the vector fields Uj,... »Un41-k are normal vector fields. 
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For a vector field Z = (Z!,... ,Z"*+1) in R”*+!, the R"+1-covariant deriv- 
ative is defined just as in the two dimensional situation. Namely, 


n+1 
VEZ =) viz Je 


i=1 


where e; is the standard i‘® basis vector for R”+1 and v[-] denotes the usual 
R”+! directional derivative v[f] = Vf+v. The covariant derivative for M is 


then defined to be the orthogonal projection of VR" onto 15M, 


" n+1— 
VvZ = pro) T,MVe Z 


where v € T;,,M, Z is defined on M and Z is a local extension of Z to an open 
set in R”+} containing M. Similarly, for a tangent vector field V, we may 


define Vy Z = proj 1,MVE OZ by 
Vv 4(p) = proj 1,MV EG) Z, 


where V is a local extension of V as well. The following exercise shows that 
VvZ is well defined. 


EXERCISE 3.1. Show that the definition of Vy Z does not depend on the extensions 
n+1 — n+1 — 

chosen for V and Z. Hints: (1) start by showing that ve “Ze vt, ™Z on M for 

Vi and V2 extensions of V. Use the fact that Vi = V2 on M. (2) Now show that 


ve 2, = ve Zo on M as well. 


From now on, for notational convenience, we shall dispense with distin- 
guishing vector fields on M from their local extensions on open sets in R”*?. 
It is important to realize that the covariant derivative on M is simply the 
R"+?! covariant derivative minus its Uj-components,i=1,...,n+1—k. 


Proposition 3.1. Let V, Z and W be tangent vector fields on M and 
f: M —R bea function on M. The following are properties of the covariant 
derivative of M: 
(i) Vv(Z+ W) =VvZ+VVW 
(ii) VevZ = fVvZ 
(iii) Vy fZ = Vif] Z+f[VvZ 


(iv) V[(Z,W)] = (Vv Z,W)+(Z, Vv W), where (-,-) denotes the usual dot 
product in R”++ restricted to vectors tangent to M. 


(v) VvZ—VzV =[V, Z], where |-,-] is defined below. 
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Proof. To prove (i), it suffices to note that directional derivative and projec- 
tion both preserve sums. For (ii), note that, by definition, 


VEY Z=S"fV[Zie =f WV [Ze = f VEZ. 
Then we take projections to obtain 


- n+1 
VevZ = proj TMV yy Z 
a n+1 
=projr,m(fVy Z) 
. n+1 
=fprojrmVy Z 
since f(p) simply multiplies vectors by a scalar and, so, doesn’t affect projec- 


tion. Then, by definition, 
VevZ=fVvZ. 


For (iv), let us compute using the definitions of (-,-) (i.e. dot product) and 
ver 


V((Z,W)] =VIS_ ZW] by the definition of (.,-) 
a 
=SOV[Z"]Wi + Z°V(W4) by the Leibniz rule 


=(VE"Z,W)+(Z,VE" W) by the definition of VR” 
= (VvZ, W) ae (Z, VvW) 


because, since Z and W are tangent vector fields to M, all dot products with 
normal vectors (U;,W) and (Z,U;) are zero. Since VR" and V differ only by 
their normal components, the equality of the last two lines of the calculation 
above follows. Part (iii) is left to the reader in the exercise below and part 
(v) will be proved below after we discuss the bracket [-,-]. 0 


EXERCISE 3.2. Prove the equality Vy fZ = V[f]Z+ f VyZ in (iii) above. Hint: 
use (ii) as a guide. 


Now let us discuss the Lie bracket of vector fields. If V and W are tangent 
vector fields on M, then we define 


[V, W][f] = VWF] - WIV]. 
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Here we are saying that [V,W] acts on a function f as a vector field should. 
We will show that this actually defines a tangent vector field on M by writing 
[V, W] as a linear combination of the 8;. Also, to avoid a surfeit of square 
brackets, we will abuse notation a bit and write [V,W] = VW[f] — WV[f]. 
First, write V = >>, v'0; and W = >> j w/8; and then compute 
WV[f]=W) OLE] 
= OW Olf] +o Wilf 


= Vwiaia/) + v'wa,atf) 


=e ; Ov' of a, O° f 
” Bul Out ow Ou'dus 


and similarly, 


so that 


a 


UaNe MAY IE E(x Sao 3) anf 


In other words, we have 


= 19. a j eT) 
[V,W] = 2 0; where a= dX € Rah aad 
and the bracket [V, W] is then a tangent vector field on M. 


EXERCISE 3.3. Apply the definition of bracket to prove the following properties. 
(1) [V,W] = -[W,V] 
(2) [aV + bW, Z] = alV, Z| + b[W, Z] where a,b ER. 
(3) [[V, W], Z] + [[W, Z], V] + [[Z, V], W] = 0 (Jacobi Identity) 
(4) [fV,gW] = fol[V, W]+ fV[g]W — gW[f]V for smooth functions f and g. 
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Now let us concentrate on proving (v) of Proposition 3.1. First, we will 
relate the covariant derivative and the bracket by a general formula which will 
eventually reduce to (v). To begin, note that [0;,0;] = 0 for all i,j. This 
follows because 0;[f] = Of /Ou’, so that 

fag Bg 0? f 
i, 0; = 0; ; — 0,0; = ; -—-——— =0 
[0 5] [f] 0; [f] 50. [f] Audus dus dui 
since mixed partials are equal. Now, for V = )>, v'A; and W = >> j w9;, we 
have 


VvyW =) v'Vo,(S_ w9;) 
a J 
= S- S (vi djw'd; + viw'Vo, 0;) 
| 
after switching 7 and 7. Similarly, 


VwV = >_>-(wId;v'd; + wv'Vo, 3). 


Jj 


Then we have 


VvW-VwV =), D (FS - v5) 0; 
J 


To prove (v), we will show that the second term in this formula vanishes. 


Lemma 3.2. Va,9; — Va, 0; = 0. 


Proof. From the general formula above applied to the R"++ covariant deriva- 
tive VR""" with 8; = 57, ae, and 0; = 37, be), we obtain 


Rt! R"+1 


Vo, 9-V5, 3 =(8:,0;)+ > (abt — ba®) VR" e; = 040 = 0 
k,l 


since [0;,0;] = 0 by the discussion above and Ver ei = 0 by definition of 
vR"" Then, since VE; = VE ai, the projections Vo,0; and Vo, 90; 
must be equal as well. O 
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Proof of Proposition 3.1 (v). Consider the general formula 


VyW -VwV =(V,W]+ 50S) (viv! - w'0') Va, 0; 
tj 


and focus on the last term >; >, (v’w* — wv') Va,0;. In this term, the 
factor in parentheses is antisymmetric in 7 and 7 while, by Lemma 3.2, the 
factor Va, 0; is symmetric ini and 7. Hence, summing over all 7 and 7 produces 
zero since a term and its negative always appear. Therefore, 


VVW -VwV =(V,W]. 
O 


Suppose M” C R”*! is a hypersurface, so that there is only one (outward) 
unit normal vector field U on M. For tangent vector fields V and Z, we may 
write 


VEZ = VW ZH(VE ZU)U 


since Vy Z is the tangential projection of ven Z. Again note that V and Z 
must be extended to an open set in R”+ to make sense of VR""’ Z. 


EXERCISE 3.4. Show that the coefficient of the normal component above is 
n+1 
(ve Z,U). 


We may write the equation above as 


Vy = VEZ (VEZ U)U 


Rrt1 Rt! 


Sy Aa UN 


since, by (iv), V(Z,U) = (VB""'Z,U) + (Z, VE" U) and (Z,U) = 0. Then 
by symmetry of (-,-), 


VWyZ = VP"Z -(S(V), Z)U 


where S(V) = Ve u is the shape operator of M in R"*1. The properties 
of Proposition 3.1 (especially (iv) and (v)) may be used to prove the following 
generalization of Theorem 2.3.1. 
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Theorem 3.3. The shape operator is a symmetric linear transformation on 
the tangent space. 
Proof. S is clearly linear by the properties of vR"*" | Because (U,U) = 1 is 
constant, 
0=V(U,U) =(VE"'U—-U) + (U, VEU) = 208) 


and hence, (VEU, U) = 0. Thus, (S(V),U) = (- VEU U) = 0, so 
S(V)(p) € T,M for all p € M. Now we must show that the linear trans- 
formation S is symmetric. Throughout the following, we use the fact that 
(U, Z) = 0 for any tangent vector field Z as well as Proposition 3.1 (iv) and 


(v). 


ve 


=({- U,W) 

= (U, oe" W) by (iv) and (U, W) = 
= (U,[V, wavy V) by (v) 

es [V,W]) +(U, VE Vv) 


(98, Oo. V) since [V, W] is tangent and by (iv) 
= (V,-ViyU) 
= (V,S(W)). 
oa 
Example 3.1: The sphere S$”. In coordinates z',... ,2”+1, the unit nor- 


mal vector field for the n-sphere in R”*+! is 


Us = Care per Ve 


i 
The Euclidean covariant derivative is just the coordinatewise directional de- 
rivative, so for a tangent vector field V = (V*,... ,V”*1), 


S(V)(p) = -VEGU 
= -F (V(p)[a"],... ,V(p)[a"**]) 


-5 (Vip) ys) 


i 
R 
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where the third line follows since the x* are coordinate functions and we have 


Viz] =>— On" ys = yigi = yi 
—~ Oxi 5 
j 
Hence, the shape operator of the n-sphere is completely analogous to that of 
the 2-sphere — it is simply scalar multiplication by the negative reciprocal of 
the radius of the sphere. 


While the shape operator for hypersurfaces plays a role similar to that for 
surfaces in R°, it must be realized that it can never assume as important a 
position in general for higher dimensional manifolds. This is simply because 
hypersurfaces are rather rare among manifolds in higher dimensions whereas 
surfaces in R? are, by definition, hypersurfaces. For manifolds in R’+! which 
are not hypersurfaces, we may replace the shape operator, or more precisely 
(S(V), Z) U, by the more general second fundamental form. When we write, 
for tangent vector fields V and Z, 


VvZ = proj r,MVe Z 


we are ignoring the normal component of vr" Let us now put it back in by 
writing superscripts T and N to denote the tangential and normal projections 
respectively; 


We Z=(Vy Z)"+(Vy 2) 
VV Z+ BV, 2). 


B(V, Z) = (VB""" Z)N is called the second fundamental form of M® c R"+1, 


Proposition 3.4. B(-,-) satisfies the following properties. 

(1) B(fV, Z) = fB(V, Z) 

(2) B(V, fZ) = fB(V, Z) 

(3) B(V, Z) = B(Z,V) 
The first two properties are included in the statement that B is bilinear; that 
is, linear in each variable separately. The last property is what is meant by 
saying that B is symmetric. 


Proof. We shall prove (2) and leave (1) and (3) to the following exercise. By 
definition, 


BV, fZ) =VB" fZ—-VvfZ 
=V(f|Z+fVP"Z-Vif]Z—fVvZ 
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by property (iii) of Proposition 3.1. Hence, 


= fVe Z = FVus 
Rt! 


=f (vE Be VvZ) 
= fB(V, 2). 


O 


EXERCISE 3.5. Show that properties (1) and (3) hold above. Hints: for (1), use 
property (ii) of Proposition 3.1. For (3), use property (v) of Proposition 3.1 and the 
fact that [V, Z] is a tangent vector field, so [V, Z|‘ = 0. 


Now, by Proposition 3.4, at each point p € M, B(p) is a symmetric bilinear 
mapping 7, MxT,M — N,M, where NpM = (Ui,... ,Un41-%) is the normal 
space to M in R"+’. From B we can define the mean curvature vector field 
to be 


where 01,... , Ox are assumed to be orthonormal at p. 


EXERCISE 3.6. Show that this definition of H, in the case of a two dimensional 
patch x(u,v) in R%, is just twice our old notion of mean curvature. 


We will talk more about the second fundamental form after we discuss Christof- 
fel symbols in the next section. A good general reference is [DoC2]. It is 
important to note that what we have done in this section is not the most 
general definition of a covariant derivative. We have already said that man- 
ifolds may be defined without reference to an ambient Euclidean space, but 
it is also the case that covariant derivatives V (or connections as they are 
also called) may be defined without reference to an ambient Euclidean co- 
variant derivative VR"*’. The essential properties of an abstract covariant 
derivative are properties (i), (ii) and (iii) of Proposition 3.1, while proper- 
ties (iv) and (v) of that result endow an abstract covariant derivative with 
the special names Levi-Civita connection or Riemannian connection. Every 
manifold (with given metric) has a unique Riemannian connection and, for 
our manifolds M* c R”+! with the induced metric, the connection we have 
defined is plainly that Riemannian connection. 
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9.4 CHRISTOFFEL SYMBOLS 


In Chapter 3.4, we introduced Christoffel symbols as the coefficients of a 
basis expansion for the second partials x,,, and x,, in terms of x,, x, and 
U. Recall that we assumed that the basis had F = x, -xX, = 0 so that we 
obtained 


Xuu = Ty Xu t yy Xe + lU 
Xuo = DyyXu + TY, Xe + mU 
vy =TeyXu +l pyXe + nu. 


We were able to identify the Christoffel symbols in terms of the metric EF and 
G (with F = 0 remember); 


“= Fy ~ Foxy +1U 
Ey Gu 
er aBk + oa + mU 
Ke Gu ae Xy + nu. 
UU ~ 9oR* IG UV 


Now, Xyz is nothing more than VE ee and similarly for xy, and Xyy. 
Also, removing the U-components [U, mU and nU from the expressions above 
then produces Vx,Xu, Vx,Xv and Vx,Xy respectively. To generalize this to 
k dimensions, we simply write 


ve 8; =>, On + tae 


and, taking tangential components, 


k 
V0.0; = 91,0 
s=1 


EXERCISE 4.1. Use Lemma 3.2 to prove that Tj; = 1, for all i,j, s. 


Just as Christoffel symbols in the two dimensional case may be expressed 
in terms of the metric, so too can higher dimensional Christoffel symbols be 
expressed in terms of the metric (in traditional higher dimensional notation) 


Jig = (O;, 0;) with g= det(9;;) 


where (g;;) is the matrix of metric coefficients. Of course, we have gi; = 9ji 
so the matrix is symmetric as well as being invertible. We also have need of 
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the inverse of the matrix (g;;) which is denoted by (g7). This inverse will 


allow us to isolate certain quantities as exemplified by the proof below. In 
particular, note that we always have the relation 


9" G38 = OF 
j 


where 67” is the Kronecker delta. 


Proposition 4.1. The Christoffel symbols are determined by the metric as 


1 ; 
Y= 3 Xo (9:95; + O91; — Oj 9si) - 
Proof. Property (iv) of Proposition 3.1 gives 


039i; = 8; (0;, 9;) 
= (Vo, 0;, 9;) + (0;, Va, 9;) 


= (5° 1),81, 85) + (Oi, 915501) 
l I 
= y (Ts9 a jit) . 
l 


Similarly, 


igis = >_ (Vij Gis + Tin 9s1) and Ojg8 = >, (Ti ,gu + Vhigst) 
; ; 


with 
039i; + O:9j;5 — 959s: = S- TisGjl- 
l 
Now, multiplying by the m*® row of the inverse metric matrix, we obtain 
; S59"? (Bigs; + O59i3 — Oj 9si) = Dae ae > 9 9 =TE 
J l j 


since >, 9" 951 = 6[”. O 
Corollary 4.2. For M = R"*!, the Christoffel symbols are all zero. 


Proof. The metric coefficients g;; are all constants, so their partials are zero. 
By Proposition 4.1, the Christoffel symbols vanish. O 
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Example 4.1. Consider a two dimensional patch x(u,v) = x(u!,u?) with 
F =x,,:X, =0. The metric matrix and its inverse are then 


a=(4 2) @=(o" ve) 


with determinant g = det(g;;) = EG. Then, for example, 


11 1 E 

Oe eee <= = — 

ee a 5 Eu + Ey — Ey) + 50 re 
11 if E 

rm, =Tl, = ~-=(0+ By —- ~.0= —., 
ta 3 BOT # +5 : 2E 


EXERCISE 4.2. Show that [%, =I. = —Gu/2E, [¥, =1?, = —Ev/2G, T, = 
T?5 = Gu/2G and TY, = 13, = Gy/2G. 


Since the formulas for the I’s above occur in the geodesic equations in 
two dimensions, it should be no surprise that they occur as well in higher 
dimensions. More generally, a tangent vector field V is said to be parallel 
along a curve a: I > M*®¥ =x(u!,... ,u*) if VaV =0 at every point on the 
curve. This condition may be translated into local coordinates as follows. Let 
v= par vs 0;. 


vv = Hy 
du‘ dvi 
5 ag 2 5 + LY LT5Os 
a j 
du‘ 0 
= Ret oe “LLe re. 


8 1 Jj 


This expression is zero exactly when each component is zero. Hence, for each 
s we have 


du’ Ov — du’ rs) 
» aoe «Do ) oo 


4 
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Now, the first term )7; (du’/dt - Ov%Ou") is precisely dv’ /dt by the chain rule. 
Therefore, 


Proposition 4.3. V is parallel if and only if, for each s, 
rae ye Vy my = = 0. 


Corollary 4.4. A curve a is a geodesic on M* if and only if 


d?us , du’ duJ 
Ge De gs = ° 


Proof. We have a(t) = x(ul(t),... ,u*(t)), so that a’ = >, (du? /dt)d;. Thus, 
the geodesic condition Vg/a’ = 0 translates into the equation above by taking 
the formula of Proposition 4.3 and substituting du’ /dt = v7. O 


Parallel vector fields always exist and are unique because the condition 
of Proposition 4.3 is a linear first order differential equation. Here however, 
we cannot determine one specific angle function which describes the parallel 
vector rotation. The reason for this is apparent. It is only in the plane that 
rotations are determined by angles. In higher dimensions, it is the special 
orthogonal group SO(n), consisting of matrices A with det(A) = 1 and AA’ = 
I, which describes rotations. 


EXERCISE 4.3. Define parallel translation as follows. Given Vo at p € M anda 
curve a on M with a(0) = p. Let V be the unique parallel vector field along a with 
V(0) = Vo. Then V(t) is the parallel translate of Vo. Show that this association is 
a linear transformation from TpM to T,(4)M. Further, this linear transformation is 
an isometry. That is, for parallel vector fields V and W along a, the inner product 
(V, W) is constant along a. It may also be shown that parallel translation preserves 
orientation and it is known that orientation preserving isometries are in SO(n). 
Hint: to show an isometry, use property (iv) of Proposition 3.1. 


In Chapter 8 we showed that geodesics arise as extrema for the energy 
integral { T dt where the kinetic energy is given by T = 1/2|a’ |?. In higher 
dimensions a’ = )>,(du’/dt) 8; and, since we do not assume 0; - 0; = 0, we 
then have |a’|? = )>; ; 9iju't). The appropriate energy integral we wish to 
extremize is then 


[rewo,... ,u*(t), wi (t),... , w(t) d = f Daw tad dt. 
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Here we have dropped the superfluous 1/2. Generalizing Exercise 8.1.2 to k 
functions, for each s we obtain an Euler-Lagrange equation 


d@(OL\ OL | 
dt \ ous dus 
We can compute each piece directly as follows. 


OL 955 i OL 74 “4 
a and ae Dstt + dst 


aus + But” 


on (a 2965 4 ois me 
a (Sie) = (sot + D Bate’) + D (guste D Bara) 
09s 4 Gis gees 
=2 is J ois ind. 
Sous (Bes i) aa 
Hence, the Euler-Lagrange equation becomes 
as O9ei OGis 09:3 ek om 
dt (Sz) - i aus = Lagi aL (aet Gui Bus uw = 0. 


Now here is where we can use the identity }°,9”°gsi = 6%” to isolate the 
second derivative i’ = >; >>, 9'"*gsiti’ above. Sum the equation above by 
9" to obtain 


ai Dsl +5 ae aed kd =0 
ms g™ 095 09is 09:; ar] 
Yom Lowi + 5s (Gost + Soe — 88) ata =o, 


which then reduces to 
im + S° mata) =0. 
i) 
These are, of course, the geodesic equations. Therefore, in higher dimensions 


as well, the geodesic equations arise from a variational principle. 


EXERCISE 4.4 Show that, for an orthogonal two dimensional patch x(u,v), the 
geodesic equations above are those of Chapter 5. 
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We now wish to show that certain normal coordinates may be chosen for 
which the covariant derivative is quite well behaved. This choice of coordinates 
will simplify our discussion of curvature in the next section greatly. 


Theorem 4.5. For a manifold M* and p € M, a patch x(u!,... ,u*) may 
be chosen about p with x(0,... ,0) =p so that 


9ij(P) = (8i(p), 8;(P)) = 6] and =~ Va, 8;(p) = 0 


These coordinates u',... ,u* are called normal coordinates. 


Sketch of Proof. The details of a proof of this result require knowledge of the 
exponential map. Since we have not talked about this subject, we can only 
outline the ideas involved here. Take the tangent space at p and, for any 
vector v € T,M, map the line through the origin of T,M in the direction v 
to the unit speed geodesic ay through p in the v direction, 


tv + a,(t). 


For simplicity, assume M is complete, so that geodesics run forever. This 
definition then defines a k-patch from T,M to M. Choose an orthogonal 


coordinate system u!,... ,u* for T,M as coordinates for the patch x. By 
definition, g;; = 6 at p since the coordinates are orthogonal. Also, for a 
tangent vector v = (v!,... ,v*), we have, 


x(ui(t),... ,u*(t)) = ay(t) = x(tv',... ,tv®) 


so that ii? = d?(tv')/dt? = 0 for all i. The geodesic equations, which of course 
hold on a, become 


STG (av(t) ure? = 2TH (ay(t))u'v? = 0, 
1,J 


and this is true for all v € T,M. But this can only happen if all the D7} are 
zero. Hence, 


V0,0;(p) = LG 


O 


With the preliminaries above, we can end this section by giving a brief 
(and somewhat simplistic) sketch of the relationship between the higher di- 
mensional mean curvature vector field and higher dimensional “area” mini- 
mization. Note how the use of normal coordinates makes the argument much 
simpler than it otherwise might be. The reader is recommended to [Law, 
Theorem 1.4] and [GHL, Theorem 5.19] for rigorous details. 
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Suppose we have a manifold M (which we assume is rather local in the sense 
of being a single patch, say) and we vary it as in Theorem 4.3.2. Namely, we 
take M, = M + tU, where we assume U is a vector field which is always 
normal to M. For a set of coordinates (u!,... ,u®), the area element is given 
by 

dA = /g du’... du® 
where g = det(g;;) is the determinant of the metric coefficient matrix. (In two 
dimensions recall that dA = VEG ~— F* dudv.) Similarly, for g, = det (9:;;) 
on M;, we have area element 


dA; = /G du‘... du® 


_ VGA 
Vg 


_ v9 
= Gea 


and the higher dimensional area is then 


A(t) = ‘a aA 


As in Chapter 4, we want to find critical points for this area functional. Note 
that the integral above depends on the point about which we take our coor- 
dinate system, so let us fix p € M and let us take a normal coordinate system 
about p. Therefore, for these normal coordinates, at p, the tangent vector 
fields 0),... , 0, are orthonormal and all the covariant derivatives V9,0; van- 
ish. We can consider the normal vector field U as another local coordinate, 
so we have, for all 7 and j, 


Now we wish to calculate 


[0;,0;}=0 and [U, 8;] = 0. 
we) dA. 


dt Ma a, t=0 


Because the coordinate vector fields are orthonormal at p, we have g = 1 
there. Hence, 


4 (v@)| _ Laat 
dt G/ \t=0 2 VI Itxo 
1 de 
2 dt Te 
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EXERCISE 4.5. Let Q(t) = (q;;(¢)) be a matrix whose entries depend on ¢ and for 
which Q(0) = J, the identity matrix. Show that 


d dqi; (0) 
a det OO lik =F (“| 


Hints: write out the definition of the determinant of Q(t), 


detQ = So (-1)8"? a16(1) * + Iho (k) 
o€S, 


where o is a permutation in the permutation group on k letters S;, and use the 
product rule to find dQ/dt. Then evaluate at t = 0 to make all terms in the 
determinant zero except for those having each factor as grr(0) = 1 together with a 
single dq;;(0)/dt. Thus, the determinant sum reduces to the sum of dq;;(0)/dt’s and 
this is tr Q’(0). 


Since g = det(g;;), the exercise may be applied to the result above to 
produce 


d(vB)) _ 25 28 
dt G/ \1x0 2 OF |teo 
k 
i Ox =z 
= oe Hi Oi» 9) t-0 
i=1 


where 8; is the vector field at level t determined by the local diffeomorphism 
mt>+m+tU. Then, 


aa) 


since U is the t-direction. By property (iv) of Proposition 3.1, we then have 


§(%Z) |< FFB B+ BYE" Blas 


= VE 8: 81), 


k 
= me (VEU, 8;) |, 
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by (v) of Proposition 3.1 and the vanishing of the brackets mentioned above. 


Hence, again by (iv), 


k 
= 25K U, 8:) |, =0 (U, VE 8:)|,-9 
k 


U,70,:) ~ (U,(VB""8,) ) 


since VR"** = V + (VR )N and, at t = 0, we are on M. Also, for t = 0, 
U and the 0; are orthogonal and the coordinates are normal, so 


where H is the mean curvature vector field of M in R"*!. After integration, 
we have the 


Theorem 4.6. The derivative of the area functional at zero is given by 


where H is the mean curvature vector field and U is the normal variational 
direction. 


If M (ie. ¢ = 0) is a critical point for all variations, then we must have 
H =0. This follows because any nonzero part of H would produce a nonzero 
(U, H) for some U and this then could be localized by a bump function to 
produce a nonzero integral, contradicting the assumption that M is a critical 
point of A(t). Thus, we have the higher dimensional version of Theorem 4.3.2, 


Corollary 4.7. If M is least area, then the mean curvature vector field is 
zero. 
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9.5 CURVATURES 


In Chapter 3.4 we developed a formula for Gauss curvature which depended 
only on the metric. This led of course to Gauss’s Theorem Egregium. If we 
look closely at how the formula was derived, we see that we used the fact 
that Xyuv — Xuvy = 0. Written in our “covariant” notation (and noting that 
the covariant derivative with respect to a parameter curve is simply partial 
differentiation), we have 


Rt! R2+1 R@+1 R®+1 


eye xe ve VE ey SO. 


Furthermore, the derivation also reveals that we made essential use of the 
basis descriptions of U,, and U,, where U is the unit normal of the surface. In 
fact, it was these terms which provided the (In—m?)/EG = K in the formula. 
Now, however, we must use M*’s covariant derivative V without the benefit 
of a unit normal. Therefore, we should not expect the corresponding quantity 


Va; V0,9s — Va, Vo; 9s 


to be zero. Indeed, analogous to the derivation of the two dimensional for- 
mula (but without the ingredients U,, and U,), we may define a vector field 
R(O;,9;)0, = >, Ri je by 

R(0;, 0;)05 = Vo; Va,9s — Vo, Va; 9s- 


The object R(-,-) is called the Riemann curvature. We want R to possess 
certain qualities. Namely, we want R to be bilinear with respect to smooth 
functions. By this we mean that R(fX,Y) = fR(X,Y), R(X +Z,Y) = 
R(X,Y) + R(Z,Y), R(X, fY) = fR(X,Y) and R(X,Y + Z) = R(X,Y) + 
R(X, Z) for any smooth function f: M — R. But, as the reader is asked to 
check in the following exercise, as yet, this is not the case! Moreover, we want 
R(-,-) to be a linear map on vector fields as well. 


EXERCISE 5.1. Use the properties of covariant derivatives to show that 
(1) R(O;,0;)fOs = fR(O;, O;)Os. 
R(fO:, 0;)Os = 0; fVa,9s + fR(O;, 0;)Os 
(2) = —Vipa,,0;]s + fR(O;, 0;)Os. 


where the last line follows from Proposition 3.1 (ii) and Exercise 3.3 (4). Hence, R 
is not linear in the first (or second) variable. 
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The last formula of the exercise gives a hint as to how to redefine R so as 
to achieve bilinearity. Namely, for tangent vector fields X, Y and Z, we have 


Definition 5.1. The Riemann curvature is defined to be 
R(X, Y)Z= Vixy]Z + V¥VxZ—- VxVyZ. 


EXERCISE 5.2. Show that 
R(8;, 0;)Os = Do R08 


or; or’, mol mpl 
> (- ai + MeL im — D7eT im O. 


Since, as we showed previously, all Christoffel symbols vanish in R"+1, the expres- 
sion above says that the Riemann curvature of R"+! is zero sveryeuere. We say 
that R”*? is flat. Further, show that 


Rijer © = = Do Rear 


ee 
ar! 
a > (5-5 rane LTE im — rm Str 


and calculate R1212 for a two dimensional patch x(u!,u?) = x(u,v) with gi1 = E, 
912 = g21 = 0 and g22 = G. Show that 


R = EG EuGu = i Ev = EyGy | EyEv Gu GuGu 
1212 4E2G E \2G 4EG2 1E2G 7 G — Fee 
and compare this with the formula above Exercise 3.4.3 to see that 
K = Piz 
g 


where K is Gauss curvature and g = det(g;;) = EG. Thus the Riemann curvature 
generalizes the Gaussian curvature. 


EXERCISE 5.3. Suppose M” c Rt! is a hypersurface with unit normal U and 
shape operator S. Show that, for tangent vector fields X, Y and Z, 


R(X,Y)Z = (S(X), Z) S(Y) — (S(Y), Z) S(X). 


Hints: (1) write 


ae n+1 n+1 
R(X,Y) = (Vy) -Sixy)) + (VE — Sy (VE — Sx) 
He SVE Sy) 
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where Sy (Z) = (S(Y), Z). (2) Use the facts that 


n+1 n+1 n+1 n+1 n+1 
Vay tVe Ve -Ve VE =0 
(since R"*?! is flat) and (vEr**y, U) = 1/2Y(U,U) = 0 (since (U,U) = 1 is con- 
stant). 


It is clear that Riemann curvature is a complicated object to deal with. 
We can gain some intuition about it however by considering vector fields in 
pairs. First, let us introduce the notation 


R(X,Y, Z,W) © (R(X, Y)Z,W). 


Then we define the sectional curvature of the two dimensional subspace of 
T,M spanned by X(p) and Y(p) to be 


R(X, Y, X,Y) 
(X,X)(Y,Y) — (X,Y)? 


As the notation suggests, the sectional curvature is a Gauss curvature of a two 
dimensional submanifold of M through p with tangent plane {X,Y}. This 
generalizes Exercise 5.2 above where K = Rjo12/g = R(O;, Ao, 01, O2)/9. 
Although we will not prove it, it is a fact that the sectional curvature actually 
determines the Riemann curvature. Of course the reverse is true by definition. 
Also note that if X and Y are orthonormal, then K(X,Y) = R(X,Y,X,Y). 


K(X,Y) = 


Example 5.2: The sphere S”. In Example 3.1 we saw that the shape 
operator of S” is given by S(X) = —X/R, where R is the radius of the 
sphere. From the formula of Exercise 5.3, we obtain 


R(X,Y)Z = (S(X), Z) S(Y) — (S(Y), Z) S(X) 
1 
= pa ((X, Z)¥ — (¥, Z)X). 


The sectional curvature has an even more perspicacious appearance — 
especially when we take X and Y to be orthonormal. For then we have 


K(X,Y) = R(X,Y,X,Y) 


1 
= Re (Xs ANY — (Y, X)X,Y) 
1 
R? 
since (Y,X) = 0 = (X,Y) and (X,X) = 1 = (Y,Y). Thus K = 1/R? is 


the constant sectional curvature of S”, completely analogous to the constant 
Gauss curvature of S72. 
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EXERCISE 5.4. Show that the Riemann curvature obeys the following symmetry 
relations. 

(1) R(X, ¥, Z,W) = —R(Y,X, Z,W) 

(2) R(X, Y,Z,W) = —R(X,Y,W, Z) 

(3) R(X,Y,Z,W) = R(Z,W, X,Y). 

Hints: (1) (Vy VxV,V) = Y(VxV,V) — (VxV,VyV) and (2) (Vix yyV,V) 

= 1/2[X,Y](V,V), both by (iv) of Proposition 3.1. For part (3), use the first 
Bianchi identity below. 


EXERCISE 5.5. Prove the first Bianchi identity: 


R(X, Y, Z,W) + R(Y, Z,X,W) + R(Z,X,Y,W) =0. 


Hints: W is superfluous, so drop it from the notation. Write out the definitions 
of the terms and group them to make use of the identity VxY - VyX = [X,Y] 
etc. Then group the terms according to the respective Lie brackets which arise and 
apply the identity again to end up with only brackets in the formula. Apply the 
Jacobi identity (Exercise 3.3 (3)). 


There is a second Bianchi identity which will prove very convenient a bit 
later. This identity involves covariant derivatives of the Riemann curvature, 
so we must understand how these are defined. The covariant derivative of R 
is given by 


VR(X,Y,2Z,W,V) = Vv R(X, Y,Z,W) 
= V([R(X,Y,Z,W)] — R(Vv X,Y, Z,W) 
— R(X, VvY, Z,W) — R(X, Y, Vv Z, W) 
— R(X, Y,Z, VvW). 
Proposition 5.1: The second Bianchi identity. 


VR(X,Y, Z,W,V) + VR(X,Y,W,V,Z) + VR(X,Y,V,Z, W) =0. 


Proof. Choose normal coordinates (Theorem 4.5) about a point p € M and 
test the identity at this arbitrary point. Let X, Y, Z, W and V be in the 
orthonormal coordinate basis with VxY(p) = 0 etc. Also note that brackets 
of these vector fields vanish as well by [X,Y] = VxY -VyX =0-0=0. 
Then 


VR(X, Y, Z,W,V) = Vv R(X, Y,Z,W) 
= Vy R(Z,W,X,Y) 
= V[R(Z,W, X,Y)] since Vy(-) =0 


by Proposition 3.1 (iv) 


a (Vv R(Z, W)X, Y) and VvY = (0) 
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and similarly, VR(X, Y, W,V, Z) = (VzR(W,V)X,Y) and VR(X,Y,V, Z,W) 
= (VwR(V,Z)X,Y). Hence, X and Y are superfluous, so we remove them 
from the notation. Then 


VvR(Z,W) + VzR(W,V) + VwRV, Z) = (Vv Vw - VwVv)Vz 
+(VzVv —-VvVz)Vw 
+(VwVz- VzVw)Vv 


and, evaluating on X, we obtain (Vy R(Z,W)+VzR(W,V)+VwR(V, Z))(X) 
= R(W,V,VzX)+ RV, Z, VwX) + R(Z,W, Vy X) since brackets are zero. 
But then, since VzX = 0 etc., we have 


(VV R(Z,W) + VzR(W,V) + VwR(V, Z))(X) = 0. 


Hence, the result is proved. C 


While the Bianchi identities may seem rather esoteric, they do simplify various 
calculations considerably as we will see below. 

Higher dimensional differential geometry offers us both the challenge of 
understanding nonvisualizable geometric phenomena and the opportunity to 
create new tools with which to study such phenomena. Some of these new 
tools are new types of curvatures which, in two dimensions, become the Gauss 
curvature. We have already seen that sectional curvature can tell us infor- 
mation somewhat obscured by the Riemann curvature. Sectional curvature, 
however, cannot be a panacea since, as we mentioned previously, it determines 
Riemann curvature. Instead, there is a general method called contraction 
which is available to us in higher dimensions and which allows us to isolate 
more tractable portions of the Riemann curvature. In order to do this, we 
will generalize the notion of a frame field which was used in Chapter 6. If 
E1,... , Eg are vector fields defined on a neighborhood of a point p € M* with 


(E:,€)) = 6 


at every point in the neighborhood, then the collection {€,... ,E,} is called 
a frame about p. One way to obtain a frame is to choose normal coordinates 
around p (Theorem 4.5) and then parallel translate the corresponding vector 
fields along geodesics passing through p. By Exercise 4.3, parallel translation 
is an isometry, so the orthonormality of the frame is preserved as it is trans- 
lated to other points in the normal coordinate patch. The one bothersome 
thing about using a frame is that we lose many of the coordinate formulas 
we obtained earlier. Nevertheless, we shall see that the advantages of a frame 
outweigh the disadvantages, so assume that we have a frame about p in what 
follows. 
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Definition 5.3. The Ricci curvature is defined to be 


k 
Ric(X,Y) = 5 (R(X, &:)Y, E:) 


i=1 


where X and Y are tangent vector fields on M*. The scalar curvature is 
defined to be 


k 
k=) Ric(E;,&;) 
j=l 
k 
= Do (RUE; EE; Ei). 
igat 


The first thing we should note is that these definitions are independent of 
the frame field we choose. The reason is this. Any two frames are related at 
a point by an orthogonal matrix; that is, a matrix A with A’ = A7!. This is 
equivalent to saying that the rows of A form an orthonormal set of vectors. 
This, together with the symmetry relations of Exercise 5.4 suffice to prove the 
invariance of the definition. Rather than give a proof of this in general, we 
will simply concentrate on the k = 2 case. So, suppose that {F;} is another 
frame field which is related to {€;} by a 2 x 2 orthogonal matrix A = (as¢) as 
follows: . 
Ej = ayj,F1 + agiFo. 


Now we can just compute using the fact that the rows of A are orthonormal. 
That is, @11@21 + @12@22 = 0, a?, + a?, = 1 and a2, + a2, = 1. 


Ric(X,Y) = (R(X, E1)Y, 1) + (R(X, £2), €2) 
= (R(X, a11Fy + d21F2)Y, a11F1 + a21F 2) 
+ (R(X, a12F1 + d22F2)Y, a12F1 + a22F2) 
= a2, (R(X, Fi)Y, Fi) + ay1a21 (R(X, Fi )Y, Fe) 
+ @91011(R(X, Fo)Y, Fi) + 03, (R(X, Fe)Y, Fe) 
+ a?.(R(X, Fi)Y, Fx) + a12022(R(X, Fi)Y, Fe) 
+ a22012(R(X, F2)Y, Fi) + 039(R(X, F2)Y, Fa) 
= a?, (R(X, F,)Y, Fi) + 2a11021(R(X, Fi)Y, Fa) 
+ a3, (R(X, F2)Y, Fe) 
+ a2)(R(X, Fi)Y, Fi) + 2a12022(R(X, Fi)Y, Fe) 
+ a39(R(X, F2)Y, Fe) 
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using the symmetry relations of Exercise 5.4. Then, 


Ric(X,Y) = (aj, + afg)(R(X, Fi)Y, Fi) 
+ 2(@11021 + 412022)(R(X, Fiyy, F2) 
ae (a3, zs a3o)(R(X, FayyY, F2) 


= (R(X, Fi)Y, Fi) oF (R(X, Fa), F2) 


by the equalities a1;a21 +1202 = 0, a?, + a?, = 1 and a3, + a2, = 1. Hence, 
the Ricci curvature is well defined. 


The second thing to note is that both Ricci and scalar curvatures come 
about through a “trace-like” process. Of course, to take the trace of a matrix 
A = (a;;), we form 


trA= So aii = S- (A(E;), €:) 


for an orthonormal basis {€,...,&,}. Here, for Ric, we take two slots of 
the Riemann curvature and sum over the same elements of the frame. Simi- 
larly, and even more closely analogous to trace, to form «, we sum over the 
only available slots in Ric. This process of summing over two slots with the 
same frame elements is called contraction and is sometimes denoted C. Thus, 
Co4R = Ric and Ci2 Ric = « since, in the first case, we sum over the second 
and fourth slots and, in the second case, we sum over the first and second 
slots. We note here that some authors order the subscripts on R;;1, differ- 
ently, so contraction subscripts may have to be adjusted to correspond to 
ours. As the reader might expect, there is a more formal and precise notion 
of “contraction,” but we shall have no need of it here. 


Example 5.4. (i) For a k-manifold M*, let us compute the (only available) 
contraction of the metric (-,-). For a frame as above, 


since (€;, €:) = 6. 
(ii) Contraction can also be done by coordinates. Without going into de- 


tails, for coordinates u/,...,u* and basis tangent vector fields 0),... , Ox 
with (R(O;, 0;)Os, 0,) = Riggers let 


Ris = S- Rijsrg?”- 
jr 
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These are then the components of Ricci curvature. Thus, by Exercise 5.2, 
-ry(B-F 
4 (58 Dut + LTT im itn da sirg” 

2 y (SF om Bote Lu i rit 


+ LKB = Th) ao 


since )>, girgi” = 6. 


In order to see the power of these curvatures, we offer the following theorems 
without proof. For a proof of the the first, see [GHL] for example. 


Theorem 5.2: Myers’ Theorem. Let M* be a complete k-manifold and 
suppose that the Ricci curvature on all of M* is strictly bounded away from 
zero; that is, 

k— 


1 
Rie(X,X) > —Z- > 0. 


Then M® is compact and the diameter of M® is less than or equal to tr. 


Compare this result with Bonnet’s result, Theorem 6.7.6. Scalar curvature 
also has the power to constrain the type of a manifold. Compare the following 
result (due to Ros [Ros]) to Alexandrov’s result, Theorem 4.4.3. 


Theorem 5.3. A compact hypersurface M” C R"+! of constant scalar cur- 


vature is a sphere S” with metric induced by that of R"+?. 


EXERCISE 5.6. For a surface defined by an orthogonal patch in R°, show that Ricci 
and scalar curvatures are given by 


Ric(X, Y) = K (X,Y) and K= 2K 


where K is Gaussian curvature. Hint: use Exercise 5.2. The orthogonal patch 
condition is, in fact, unnecessary, but Exercise 5.2 assumed it for simplicity. 


EXERCISE 5.7. Show that the following formula holds relating the covariant deriv- 
ative, the metric and the bracket. 


2X, VzY) = Z(X,Y) +Y(X,Z) —-X(Y,Z) 
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EXERCISE 5.8. Here is a first small step toward getting away from the induced 
metric of R”+!. A metric (-,-) is a multiple of another metric (-,-) if 


(X,Y) =X(X,Y) 


for all X and Y, where A is a constant. If \ is allowed to be a smooth function on M, 
then the metrics are conformal. Denoting the covariant derivative and curvatures of 


the associated metric (., by Vv, R, Ric, k, show that the following properties hold 
for a “multiple” metric. 


(1 


VxY =VxyY. (Use the formula from Exercise 5.7.) 
(2) R 


(X,Y, Z,W) = R(X, ¥, Z,W). (Use (1).) 


Ric(X, Y) = Ric(X, Y). (Use (2).) 


) 
) 
(3) K(X,Y) = 5 K(X, Y). (Use (2).) 
(4) 
) 


(5) k= xh (Use (4).) 


EXERCISE 5.9. Let P* = {(u!,...,u*) € R*|u* > 0} be the “upper half space” 
analogous to the two dimensional Poincaré plane. Define an analogous metric on 
P¥ by 


(v,w) = ee 


In our present language, this means that 


psiecen a Se. 3 
nC aC 


where the e; are the standard orthonormal R* basis vectors. Hence, the metric is 
diagonal. The sectional curvature of P is known to be constant at —1. The follow- 
ing steps lead to a partial understanding of this result. For further and complete 
details, including formulas for general conformal metrics, see [DoC2, Chapter 8.3]. 


(1) Show that the Christoffel symbols are 
1 I 
rij = 3 9 ” (8:93: + 85911 — A953) 


=i" (Oi95r + O59ri — Or gj) 


nNolRe 


and that the only nonzero ones are 


j 1 . k 1 
Myer frit#k Tae 
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(2) Now show that the components of Riemann curvature relevant to sectional 
curvature are 


ori, 4s 
™m 


ari, ar, Pane 1 
= Jt Mj _ pmp) mains 
= (Fe oy + OTHE Sm ts bea (<n) 


and the only nonzero components are Ryjx; = Rikik = Rizig = ~1/(uk)4. 
Here we assume k # j in Ryjpj,t Fk in Rigip andiA~j,ixAk andj #k 
in Ri jij. 


(3) Using the calculation of (2), show that the components of sectional cur- 
vature Kj; = Rijij/(gii 933) for the cases above are constantly equal to 
—1. This exercise indicates how the conformal metrics of Chapter 5 may be 
generalized to higher dimensions. 


EXERCISE 5.10. A manifold M* is said to have nonnegative sectional curvature 
if K(X,Y) > 0 for all X and Y. The Ricci curvature is said to be nonnegative 
when Ric(X,X) > 0 for all X. Show that nonnegative sectional curvature on M 
implies nonnegative Ricci curvature on M and, in turn, this implies nonnegative 
scalar curvature on M. Hint: First prove the formula 


k- 
Ric(X, X) = (X, X) 3 K(X, 5) 
g=1 


for a frame {€;} spanning X + the space of vector fields orthogonal to X. 


EXERCISE 5.11. Here is an exercise which at once goes beyond our definitions 
and tests them. A Lie Group G is a smooth manifold with a smooth associative 
multiplication G x G — G having an identity e and inverses. Take G to be compact 
and connected. For example, the special orthogonal group SO(n) is a compact 
connected Lie group. We may define certain vector fields on G by fixing a vector 
X(e) € TeG and then taking X(g) = gx(X(e)) € TgG. Here, gs is the map induced 
on tangent vectors by the multiplication (g,h) 1 g-h for all hh € G. These types 
of vector fields are called left invariant vector fields. Lie groups are quite symmetric 
in the sense that the covariant derivative (which may be defined without reference 
to an ambient Euclidean space, but which has all the properties of Proposition 3.1) 
and curvature at the identity determine the covariant derivatives and curvatures 
at all other points of G. Furthermore, it is known that Vx X = 0 for X any left 
invariant vector field. Thus, curves whose tangent vectors belong to left invariant 
vector fields are geodesics. Also, for G as above, the following identity is known to 
hold: ([X, Y],Z) = —(Y,[X,Z]) for left invariant vector fields X, Y and Z. Here, 
(:,-) is a metric which is first defined on TeG and then transported around G by the 
Lie group multiplication. 
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Now, for a compact, connected Lie group G with left invariant vector fields X, 
Y and Z, show that 


(1) VxY = 5 [X,Y]. (Consider Vx_y(X — Y) and use Proposition 3.1 (v).) 


(2) R(X,Y)Z = 5 (XY), Z] 
(Start with the definition of R, then use (1) to express R completely in 
terms of brackets. Finally, use the Jacobi identity for the bracket (Exercise 
3.3 (3)).) 


(3) If X and Y are orthonormal, then K(X,Y) = j LX, y]|’. 


Thus, sectional curvature, and hence, Ricci and scalar curvatures are non- 
negative for a compact, connected Lie group. (Use the definition of K, (1), 
Proposition 3.1 (iv) and the identity ([X, Y], Z) = —(Y, [X, Z]) with Z = Y.) 


The Killing form of a Lie group G is defined to be 6(X,Y) = tr(adx ady), where 
adz(W) = [Z,W] is a linear transformation of the vector space of left invariant 
vector fields. It can be shown that b is symmetric, bilinear and invariant under all 
automorphisms of G. If 6 is also nondegenerate, in the sense that b(V,W) = 0 for 
all W implies V = 0, then 6 is actually the (bi-invariant) metric (-,-) on G. That 
is, (X,Y) = 6(X,Y). A Lie group with nondegenerate Killing form is said to be 
semisimple. For instance, SO(n) is semisimple. Using (2) above and the definition 
of adz, show that a semisimple Lie group with metric 6 has Ricci curvature 


Ric(X, Y) = -; b(X,Y). 


To end this section, we wish to make several calculations using the defi- 
nitions above to try to understand a tiny bit of modern physics. Einstein’s 
general theory of relativity is based on a principle that energy and momentum 
distort the geometry of spacetime and, as a consequence, are responsible for 
various physical phenomena such as light “bending” around the Sun (along a 
spacetime geodesic) and the precession of the perihelion of Mercury. Although 
this principle is simple to state in our naive discussion, the precise form of the 
relationship is far from clear. If we are to believe in the relationship at all, 
then perhaps the simplest such formula is 


G=cT 


where G is some type of curvature (called the Einstein curvature) and T is 
a quantity (called the stress-energy tensor) which depends on the amount 
of energy-momentum in a particular region of spacetime. In particular, in a 
vacuum we have T’ = 0 — but interesting physics (such as the bending of light) 
can still occur. If G were Riemann curvature R, then the formula G = cT 
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together with the assumption JT = 0 would imply the vanishing of Riemann 
curvature. But, as we have said earlier, R = 0 implies that a manifold is flat. 
Hence, spacetime would be forced to be flat and this would contradict the 
interesting geometrically induced physics observed. So, G cannot then be the 
Riemann curvature. The next logical choice to try is the Ricci curvature, so 
we take G = Ric provisionally. But we must ask, what else do we desire in 
a physical quantity? Certainly, the law of conservation of energy-momentum 
must be obeyed, so we obtain the vanishing of some sort of divergence of T. 
This means that we should also require G to be divergence free as well. Let 
us consider this now. 


If V = (V!, V?, V3) is a vector field in R® with coordinates x1, 2? and 2°, 
then we know 


av! av? av3 ee 
div V = aut a Dae. T+ 3S = 2 Ve, V, 4) 


where the e; are the usual Euclidean orthonormal basis vectors. 


EXERCISE 5.12. Verify the last equality 


av! av? av? R? 
pat + Ba2 ot a0 > > (Ve, V, e;). 
a 


In the same way, on a manifold M*, we can define the divergence of vector 
fields and bilinear quantities with two slots (i.e. 2-tensors) such as the metric 
(-,-) and Ric. Let A(-,-) be such a quantity and define the divergence of A, 
div A(-), to be a linear quantity with one slot (i.e. a 1-form; see section 6), 


div A(X) = S* Ve, A(Ei, X) 


=) EAE, X) — A(Ve,Ei, X) - A(Ei, Ve,X) 


where {€;}*_, is a frame and we use a definition for VA similar to that for 
VR in the second Bianchi identity Proposition 5.1. There are several things 
to note here. First, the divergence produces a “tensor” which is completely 
determined at each point by the values of its constituents at that point. Also, 
although we have used a frame to define the divergence, this is in fact only a 
convenient way to compute — the divergence may be defined without reference 
to a particular frame used for computation. With this in mind, consider 
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Example 5.5: The metric (-,-:) has zero divergence. To see this, take 
normal coordinates about an arbitrary point p € M and extend to a frame 
{€,,...,Ex} by parallel translation along geodesics emanating from p. In 
particular, we have Ve,€; = 0 at p. Now, because the divergence is determined 
by the values of the frame etc. at p, the same calculation as below may be 
made for a normal coordinate frame about every point in M. At p we have 


div (. ae Ve, (Ej, Ei) 


“36 (Ej, Ei) — (Ve, Ej, Ei) — (Ej, Ve, Ei) 


= EEE = 
j 


since we have normal coordinates at p. But the first term vanishes as well 
because (€;,€;) = 6) on the coordinate neighborhood and the €;-derivative of 
a constant is zero. Therefore, 


div (-,-)(E;) = 0. 


Since this is true for all €; and the process is linear, then it holds for all vector 
fields as well. 


EXERCISE 5.13. For a smooth function f: M — R, show that 


div(f(-,-)) = df 


where df is the differential of f defined by df(X) = X[f] for a vector field X. 
Thus, the divergence of conformal metrics (i.e. f > 0 at every p) depends on the 
conformality factor alone. Note that this makes sense in light of Example 5.4 since 
the divergence depends on the frame chosen. Verify that, for metric f(-,-), the frame 


{(1/./Ff) Ei} gives div(f(-,-)) = 0. 


We will now apply the divergence to Ricci curvature to find a remarkable 
relationship. 


Theorem 5.4. For Ricci curvature Ric and scalar curvature k, 


dx = 2 div(Ric). 
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Proof. Again choose a frame derived from parallel translation along geodesics 
of normal coordinates about an arbitrary point p € M. Hence, at p, Ve,E; = 0 
for all i and 7. Then 


2 div(Ric)(E,) = 2 py Ve, Ric(E;, Es) 
=2)_ &; Ric(E;,€,) 
J 


=2 Ss E; (R(E;, CiCa; E;) 


1,J 


= SS (R(E;, E)Es, Ex) + >> Ed R(E:s Es) Es E5) 


i,j 


where the second term is obtained by switching 7 and 7. Then, the symmetry 
relations for R imply 


2 div(Ric)( = Ee (Ej, Ex)Es, Ei) + EUR (Ej, E:)E5, Es) 
i,j 
= >> VR(E;, Ei, Es, Eis Ej) + VRE; Eis Ej, En, E) 
4,9 
=-))VR(E;, Ei, Ei, Ej, Es) 
4,J 


by the second Bianchi identity. Then, interchanging the first two coordinates, 


2 div(Ric)(E = D2 VRE: G5, E;, Ej, Es) 


i,9 
=) ERLE E38) 
1,J 
=€, > RE, Ej, Ex, Es) 
4,5 
= €,[k] 
= dk(Es). 


Since this is true on a basis, it is true in general. CJ 
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EXERCISE 5.14. The manifold (M*, (-,-)) with metric (-,-) is an Einstein manifold 
if Ric(X, Y) = A (X,Y) for a fixed constant and all vector fields X and Y. 


(1) Show that, if Ric(X,Y)(p) = A(p) (X, Y)(p) for a smooth function 
\: M* — R with k > 3, then M®* is Einstein. Hints: (1) first compute « 
and (2) then use Theorem 5.4. 

(2) Show that a semisimple Lie group with Killing form metric b is an Einstein 
manifold. Hint: use the last part of Exercise 5.11. 


Now, what has been the point of all this? The argument we presented 
earlier suggested that the Einstein and Ricci curvatures should be identified 
and that the Einstein curvature should be divergence free. But, as Theorem 
5.4 shows, in general, the Ricci curvature has a non-zero divergence. So what 
is to be done to rescue the idea? By Exercise 5.13, we know 


div(K(-,-)) = dk 
so that div(«(-,-)) = 2div(Ric) and, therefore, 
oe 
div (Ric —5 K(-, ») = 0. 
This calculation then says that, if we define the Einstein curvature to be 


G= Ric —$ K(-,-), then we obtain a type of curvature which is divergence 
free. Also, we have 


Theorem 5.5. The Einstein and Ricci curvatures determine each other. In 
particular, G = 0 if and only if Ric = 0. 


Proof. Denoting contraction by C, we may write 
ee 
G = Ric ——= k(.,-) 
2 
1 
= Ric 5 C(Ric) (-, -) 
since the contraction of the Ricci curvature is scalar curvature by the dis- 


cussion preceding Example 5.4. Of course, Example 5.4 itself shows that 
C((:,-)) = dim M, so we also have 


C(G) = C(Ric) - 5 wC((-,-)) 


1 
= — 5 «dim M 


= a-=" ‘ 
= ; : 
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Now, spacetime has four dimensions — time and R? — so dim M = 4. Hence, 
C(G) = —« and 


Rie = G+ in(-,-) 


2 
1 
=G-5C(G)(-,). 
Thus, Ric and G determine each other. O 


Therefore, it seems that the Einstein curvature is a good choice for the 
Einstein field equation G = 8xT (where the constant 87 is determined by 
taking the Newtonian limit). Indeed, in a vacuum, where T = 0, it is generally 
more convenient to solve the equations Ric = 0 rather than G = 0, so the 
relationship with Ricci curvature is important. Perhaps we should mention 
what the word “solve” means here. What is known and what is unknown? 
Generally, we try to determine the geometric structure of spacetime from 
a hypothesis on the Ricci curvature and a knowledge of some facet of the 
relevant geometry. In Example 5.4 we showed that the Ricci curvature is 
given in terms of Christoffel symbols and their derivatives. These, in turn, 
are given in terms of derivatives of the metric. Hence, the expressions 


Ris = 0 


form a system of second order partial differential equations in the metric. 
Therefore, when we solve this system, we are determining the metric in that 
region of spacetime. Since the metric determines everything, it can be safely 
said then that we understand the geometry of spacetime in that particular 
region. 


Example 5.6: The Schwarzschild Solution. The solution (i.e. the metric 
coefficients) for the vacuum field equations R;, = 0 outside a spherically 
symmetric body of mass M is given by 


2M “— . 
es alas m=- (1-2) g22=-1T* 33 = —r* sin” p 


where time ¢ is usually given the index 0. This is the Schwarzschild solution 
discovered by Schwarzschild in 1916. Note that this metric is different from 
our usual ones in the sense that it is not positive definite. Spacetime is, in 
fact, based on the flat (Minkowski) metric of special relativity which has the 
form goo = 1, gi11 = —1, g22 = —1 and g33 = —1. These metrics are said to 
be semi-riemannian metrics and much of the theory of ordinary Riemannian 
geometry carries over to these metrics as well. 


There are many good texts which discuss the material of this section in 
great detail. In particular, for general discussions of higher dimensional ge- 
ometry, see [Spi], [GHL] and [DoC2]. For a particularly elementary discussion 
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of the relativistic bending of light and the precession of Mercury’s perihe- 
lion, see [Fab]. For general relativity itself, see the classic [MTW] and for a 
beautiful combination of differential geometry and relativity, see [O’N2]. 


9.6 THE CHARMING DOUBLENESS 


But the beauty here lay in the duality, in the charming doubleness ... 
— Thomas Mann (Felix Krull, p. 77) 


Everything that we have done above has a dual formulation which is often 
powerful and quite beautiful. First, recall that any (real, say) vector space 
V has a dual vector space V* = {f: V > R|f is linear} consisting of linear 
functionals on V. In terms of a basis for V, {vi,...,Vx}, the dual vector 
space has basis {v1,... ,¥*} with the defining property 


vi(vj) = 6. 


In the same way, if the v; are tangent vector fields which form a basis at each 
point p € M, then their duals are called 1-forms and are denoted by 6°. Of 
course, the dual 1-forms have the defining property 6*(0;) = 65 as well. If we 
take the vector field basis to be {0;}, then a dual 1-form 6’ is usually denoted 
by du’ to indicate that coordinate vector fields 8; = 0/Ou' are in use. 

Now, a linear transformation f: V @V — V with f(v; @vs) = do, tv, 
may be described by writing f: V > V* @V with f(vi)(vs) = f(vi @ Vs). 
Perhaps we should remind the reader here that the symbol ® stands for ten- 
sor product. The tensor product of two vector spaces V and W with bases 
{vi,...,Vm} and {wi,...,Wn} is formed by taking the vector space V @ W 
with basis 


{vi @ w;} 
where i=1,...,m andj =1,...,n. Of course we may also write 
f(vi) = So a5,v @vr 
ir 


where the a”, € R and, since there is a dual basis element in the formula, we 
may evaluate at v, to get 


f(vi)(vs) = S- atv (Vs) Vr vi 
jiv 
= Ss; agiVr- 
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But, f(vi)(vs) = f(vi ® vs) = 2, 1°%,v, and, comparing the two formulas, 
we see that 

af, = Ti, 
and this relation holds for all i and s. Hence, f(vi) = > ie 7,0) @ vy. 

We can now do the same thing for the covariant derivative. More precisely, 
if V denotes tangent vector fields on M, then the covariant derivative is an 
R-linear map V: V@®V — VY. Just as above, this map may be rewritten in a 
dual form as V: V > Y* @ V with 


V(Ei) = > wf, 


where {€),... ,€,} is a frame on M* and the w? are 1-forms. Note that 
we omit writing the tensor product symbol ® for convenience. Also, now 
that we see that this is simply a different description of the same covariant 
derivative, we can dispense with V and simply write V. Let us identify the 1- 
form coefficients in terms of something we know. Let {6;,... ,4,} denote the 
dual 1-forms to the frame {&,... ,&,} and consider the following calculation 
analogous to the vector space calculation above. (We use symbols T° to denote 
coefficients in a basis decomposition even though the chosen frame may not be 
a coordinate frame. Therefore, the I'’s are not the usual Christoffel symbols 
in general, but, in case €; = 0,, then T = Pr.) 


Dae = Ve, Ei 
= V(Ei)(Es) 


on ee 


by the vector space formula above. s the 1-form formula, however, we have 


V(E (Es) = DoF (Ex) & 


sO 


>>| Ee (E) | & => of (E) E 
r j Tr 
Since this holds for all s, 
eh 6 = wr 


Therefore, the connection 7a w; may be written in terms of the dual 
frame and the coefficients of the covariant derivative’s basis expansion. 
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EXERCISE 6.1. Show that 
ahs — Tp & = (Ei, 5] 
so that there is no reason, in general, that Mi = r,. Hint: Proposition 3.1 (v). 


Also, show that Mi = I). Hint: Proposition 3.1 (iv). 


EXERCISE 6.2. Show that 


Hint: Proposition 3.1 (iv). 


Riemann curvature is defined in terms of “second covariant derivatives,” 
so if we want to have a dual version of this as well, then we must know how 
to differentiate 1-forms. For a moment, consider a function f of k-variables 


u',...,u® and compute its differential to be 


df = 3 ef du’. 


The du’ are 1-forms dual to the vector fields 8;, so df is itself a 1-form. This 
calculation from calculus tells us that the derivative of a 1-form should be a 
2-form. Now, just as a 1-form acts on vector fields, a 2-form should act on 
pairs of vector fields. 


Example 6.1. Let @ and ¢ be 1-forms. Define the 2-form 6 A ¢ by how it 
acts on a pair of vector fields V and W: 


OA GV, W) = OV) - d(W) - O(W) - o(V). 


This “product” 6/¢ is called the wedge product of 1-forms. Note that 1-forms 
have the property that 0\¢ = —@A9. In particular, this means that 9A@ = 0 
for all 1-forms @. 


To conform with the definition of df above, we define the exterior derivative 
of a 1-form f du® to be 


d(f du*)=S° at du? \ du’ 
j 


where the u’ are coordinates. This is equivalent to defining d by the formula 


d6(V,W) = V[a(W)] — W[A(V)] — a([V, W]) 


for any two vector fields V and W. 
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EXERCISE 6.3. For coordinate vector fields V = 0; and W = O,, show that the 
formula d0(V, W) = V[@(W)] — W[@(V)] — 8([V, W]) gives 


d(f du’) = 0; f6F — 8,6! 


where we may assume 7 < r. Explain why this is the same as the coordinate formula 
for d above. 


Theorem 6.1: First Structure Equation. Let {€;} be a frame on M* 
with dual frame {0°} and connection coefficients {w’}. Then, 


do’ = -So uj A@. 
l 
Proof. We will compute both sides of the equation on vector fields in the 
frame. 
d6"(E;,E,) = —8'({E, Er]) 
since 6*(E;) = 6; has zero derivative. Then, by Proposition 3.1 (v), 
d6"(E;,E,) = —0° (Ve, Er — Ve,€;) 
= —6' (VE,(E;) — VE;(Er)) 
= —69 (ste) E:— >| wi (E,) ‘i 
l l 
= wi (Ep) — wi (E;) 
since 0°(€,) = 6}. Now we compute the sum of wedge products. 
Dt NO (Ej Er) = D | (wi(Es) 9! (Er) — wi (Er) (E5)) 
l l 
= wi (Ej) — wf (Er). 


Because these calculations are the same on a basis, the general formula follows. 
O 
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Now let’s relate these ideas to Riemann curvature. We write, as usual, 
R(Ei, Ej Em = Do, Rojm Er and we compute 


R(E;, Com = ViEi€;)Em + Ve; Ve,Em — Ve, Ve; ES 


= VEm (Ei, €5]) + V(VEm (Ei) ) (Es) — V(VEm (Es) (Ei) 


= Dm (Es Eder + 2 w, (E:)Es)(E;) 
2 0 wn (Ej) Es) (Ex) 
= dem ((E EE. + Do Eien EE. E Dvn (Ei) ps wi (Es)Ep 
SS Do EwinlENE bs en(és) Tonlevé, 


=— 3 [Exwr, (E;) = E, wi, (E;) = wm ([E:, E;})) Es 


+> S Wn (Ei )ws (Ej) — win (Ej wg @) E, 
=-)> (sn) =n nerte.€) Er 
=-S (der, - > wh, Aw’) (Ei, Es) Er. 


If we let QF, = dwf, — 3°, ws, Awy, then we obtain 


Rim = —Qm (Eis 5). 
The matrix of 2-forms Q = (QF) is known as the curvature 2-form. Now, we 
could have defined (2 by the formula Rf,,,, = —Q1,(E:,€;) and then we would 
have 


Theorem 6.2: The Second Structure Equation. If 2 denotes the cur- 
vature 2-form and wy, are the connection 1-forms determined by V, then 


Te fee. r 5 8 Tr 
Qn me du, ~ Win A W,- 
8 
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Whether we define 2 in terms of R or derive the relationship, we see that 
we can determine curvature by combining the exterior derivative and wedge 
product of the connection 1-forms. 


Example 6.2. Suppose M: x(u,v) C R° is a surface patch with x,-x, = E, 
Xy,‘Xy = 0 and x,-x, =G. Define a frame by 


Xy 

a. 

Let 61 and 6? be the dual frame having 6*(£;) = 6!. Hence, in terms of the 
coordinate 1-forms, 6! = /E dx, and 6? = /Gdx,. Then, 


and E> = 


E E 
~—= dx, \ dx, + — dx, A dx,, 
WE WE” 


= ecseeh dx, A 6? 


2V EG 
since dx, \ dx, =0 and dx, = 6?7//G. Similarly, 
Gy 


do = 


do? = — dx, A 6}. 
2/EG ” 
Now, if we write wi = adx, + bdx,y, then w? = ~adx, — bdx, and d6! = 
adx, A 6? and d0? = —bdx, A 6). This means that 
ayes and b= Gu 
2/EG 2/EG 
Hence, w3 = ats dXy, + ate dx, and 


1 E G: 
ee _ Vv uUu 
dw3 = 5 (( Tak dx, A dx, + ice), dx, A dx, 


= 5a ((yea),* (Vee),) 9 
2VEG EG/, EG/, 
since 61//E = dx, and 6?//G = dx,. Therefore, 


dui = —K 6) p62, 


where K is Gauss curvature, by Theorem 3.4.1. Of course, we have met the 
1-form w4 before in Chapter 6, but there we confined it to be along a curve 
so that it could be thought of as a function. Here its true nature as a 1-form 
is revealed. The formula above, which is surely one of the most beautiful in 
all of Mathematics, also explains results such as Exercise 6.3.7. (Why?) 
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Example 6.3: The sphere 9%. We have E = R? cos? v, F = 0 and G = R? 
for the usual patch on $?. A dual frame is then given by 


6! = Reosvdx, and 6? = Rdx,. 


The exterior derivatives are 


do: = sinu dx, A 6” and do? = 0. 
Hence, wi = sinudx, and we have 
dw = cosv dx, A dxy 
cos U 


R? cos v 


1 2 
=a e Ne % 


Therefore, comparing this result with the formula dwi = —K 61/6?, we obtain 
K =1/R? just as we should. 


EXERCISE 6.4. Compute K for other surfaces using the formula dw} = 
—K6' Ae. 


In modern geometry and physics, differential forms play an important role, 
both conceptually and in terms of calculation. For a classical approach to 
forms, see [Fla]. The reader will also find many applications to physics there. 
A newer treatment is given in [Dar], where forms are applied to understand 
modern gauge theories. For an approach to surface geometry via forms, see 
[O’N]]. 
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ANSWERS AND HINTS 
TO SELECTED EXERCISES 


CHAPTER 1: THE GEOMETRY OF CURVES 


Let p = (—1,0,5) and q = (3,—1,—2) and substitute in the equation a(t) = 

p+t(q-p). 

a(t) = (-1+t,6 — 5t,5 — 8t, 9t). 

The vector (v1, v2,0) is the hypotenuse of the right triangle whose sides are the 

vector (v1,0,0) and a translation of the vector (0,v2,0). The vector (v1, v2, v3) 

is the hypotenuse of the right triangle whose sides are the vector (v1, v2,0) and 

a translation of the vector (0,0, v3). 

135° intersection. 

Consider a(0), a(%), a(m), and a( 3). What is the distance of each of these 

points from the origin? Also note that a(t) - a’(t) = 0 and that a(t) = —a(t). 

Start with the basic parametrization a(t) = (rcost,rsint) and recognize that 

t = 0 should correspond to the point (r+ a,b) in the zy-plane. Similarly, t = 5 

should correspond to the point (a,r +b). 

a(t) = (acos(t), bsin(t)). 

Use the chain rule on G(s) = a(h(s)). 

Show that |a’(t)| =r; s(t) = rt; and t(s) = %. Use the definition G(s) = a(t(s)) 
8 


to obtain G(s) = (rcos £,rsin =). 


Show that |a’(t)| = Va? sin? t + b? cos? t. Is s(t) = i, la’(u)|du integrable in 
closed form? 

Using the fact that T’-e, = |T||e;|cos@ and taking derivatives of both sides, 
first show that KN -e, = —sin 0%. There are two normals to @ at any point 
on the curve, so we have two cases: (1) N = Nj, where the angle between Nj 
and e; is 9+ 4; and (2) N = No, where the angle between N2 and e is 5 — 0. 
Use KN - e, = —sin 48 and the definition of the dot product to show that, for 


8 


N=Nj,%= 98, and for N = Nz, n= —%. 
Recall that cofactor expansion gives 


a j k 
Vv v3 |. v v : v v 
ey 09, 03) = 2 i aie Bd 3 es 1 2\p 

W2 W3 W1 W3 W1 we 
Wy W2 W3 


Recall that interchanging two rows of a determinant changes the sign of the 
determinant. What does this imply about w x v and v x w? 
Try Maple. 
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3.5 Rewrite Lagrange’s Identity as 
lv x wl? = Jul? |w|? — Jul? Iw]? cos” 6 


and simplify the right-hand side to obtain the desired result. 

3.7 The area of a parallelogram is given by bh, where b is the length of the base and 
h is the altitude. In the parallelogram spanned by v and w, h is equal to |v| sin 6, 
where @ is the angle between v and w. 


8 (1) Show that 6’(s) = (GE, ==, #). 


(2) Note that T’ = 6” = (oe: ew iert 0). Also, since T’ = KN, |T’| = 
[n||N| = k. 
(3) Since T’ = KN, N=E, 
(4) B’ = -TN, so |B’| =| — TIN], or |B’| =|-—7| =r. 
3.16 Use the fact that p = G(s) + r(s)@’(s) for some function r(s). Differentiate both 
sides to obtain 


(1+7r’(s))T +r(s)(KN) = 0. 


Take the dot products of both sides with N to establish that xr(s) = 0 and with 
T to establish that r(s) # 0. 

3.17 Let a(s) —- p = aT +bN +cB so that T-(a-—p) =a, N-(a—p) = b, and 
B-(a—p) =c. Recognize that (a — p) - (a —p) = R? since a lies on a sphere of 
center p and radius R. Take derivatives of both sides of this equation to obtain 
an expression for T’-(a—p). Then take derivatives of both sides of T-(a—p) =a 
to obtain an expression for N - (a —p). Finally, take derivatives of both sides of 
N - (a —p) = b to obtain an expression for B - (a — p). 

3.18 Use the previous problem. Let the constant be R? and show that a+tN + 1(4)'B 
is a constant. 

4.1 If the road is not banked, a’’(t) can be resolved into two components: (1) tan- 
gential acceleration = T(t) = 0 since the car is traveling at a constant speed; 
and (2) centripetal acceleration = xv” N(t). By Newton’s Law, the magnitude of 
the force due to centripetal acceleration is |mxv?N(t)| = mxv*, which must be 
balanced by the force due to friction, mg. 


If the road is banked, there are three primary forces acting on the car: (1) a 
downward force mg due to gravity; (2) a corresponding normal force exerted by 
the road; and (3) a kinetic frictional force preventing the car from flying off the 
road. The static frictional force preventing the car from sliding downward is 
negligible. 

Recall from physics that |f| = y|N|, yielding fe = wNy and fy = wNz. Summing 
the vertical forces on the car yields 


Ny = mg + fy = mg + LNs. 


The total of the horizontal forces, fz + Nz, produces the centripetal acceleration, 
so we have 
mv” = fr +Nz = wNy + No. 
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Solve simultaneously for Nz and Ny to obtain 


Nz = — pg] 


in? 
ey : 
Ny = itn a [g + pxv”]. 


An expression for tan@ may now be obtained. Solve this expression for vy and 


obtain 
~ | g(tand +p) 
~ V «(1 — ptan@)° 


4.2 Usex= oe for both parts of this problem. In the case of the general plane 


curve a(t) = (x(t), y(t)), we have a(t) = (2'(t),y'(t)) and a(t) = (x(t), y""(t)), 
leading to: 
[(e'(t))? + (WO) 
4.5 As part of Exercise 4.4, we have |a’(t)| = V2cosht. 


5.1 Since 7(s) is not sara a unit speed parametrization, use Ky = Be 


(1) Show that 7’ x 7" = KB —Kcos@(u x N). 
(2) Show that u x N = cos6B — sin 6T. 

(3) Show that |7’ x 7"| = csin@. 

(4) Show that |y’| = sin@. 

5.2 (=) 6 acircular helix > y a circle > «+ is constant. A circular helix is a special 
case of a cylindrical helix. Thus, T’- u = cos@ is constant. What do these results 
imply about k = K+ sin? 6? Finally, use the fact that for a cylindrical helix, = is 
a constant. 

(<=) 7 and « constant > 2 = cot@ is constant. Use this to show that ky is 
constant. Also show that 7 = : and conclude that 7 is a circle. 


2.19 Use the fact that kK = 7 = Tan ay (Exercise 2.10) to show that cot@ = Z isa 


constant. 
2.20 Use race to compute k = i Use Seaeae to compute tT = —8. What is 
true if both 7 and x are constants? 
5.5 Prove that 7 constant > 4b* = 9a? by using MAPLE. 
5.6 WLOG assume that @ has unit speed. Show that wt) -u = 0 or, equivalently, 
4(T'(s) -u) = 0. Use the fact that (T(s)-u)! = T’(s)-u+T(s)-u' =T'(s)-u 
since u is a constant vector. 


CHAPTER 2: SURFACES 


1.1 (=) If zu and zy are linearly dependent, then zy, = cry where c is a scalar. Use 
the fact that xu x 2y may be expressed as a determinant and use properties of 
determinants. 

(<=) tu X Zy = 0 => |ru||zv|sin@ = 0. What does this imply about 6, the angle 
between ty and Ly? 


1.2 


1.4 


2. 


-_ 


2.2 


2.4 


3.3 


4.2 
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Use a Monge patch, z(u,v) = (u,v, ue + v’); determine the u-parameter curve, 
x(u, vo), and the v-parameter curve, z(uo,v). Note that each of the parameter 
curves lies in a coordinate plane of R?. 

x(u,v) = (u,v, +V1 — u2 — v2), 

A ruling patch for a cone is of the form z(u,v) = p+ v6(u) where p is a fixed 
point. Let p = (0,0,0) as the cone emanates from the origin. The line that is 
to sweep out the surface must thus extend from (0,0,0) to a point on the circle 
(acos u, asin u, a) lying parallel to and above the ry-plane. (The z-coordinate of 
any point on the circle is a because z = 2? + y? = V a? cos? u + a? sin? u = a.) 
A ruling patch for a cylinder is of the form z(u,v) = G(u) + vq where q is a fixed 
direction vector. The directrix G(u) for a standard cylinder is the unit circle 
in the ry-plane (cos u, sin u,0). We want a standard, right circlular cylinder, so 
q = (0,0,1). 

To show that a surface is doubly ruled, we need to identify two ruling patches for 
the surface. Since z = zy = f(x,y), we can use a Monge patch z(u,v) = (u, v, uv) 
and write z(u,v) in terms of G(u) = (u,0,0) and 6(u) = (0,1,u). Alternatively, 
y(u, v) = (v,u, vu) is also a patch for the surface. 

A patch for the helicoid is x(u, v) = (0,0, bu) + v(acosu, asin u). 

A directrix for the hyperboloid of one sheet is the ellipse G(u) = (acosu, 
bsin u,0). Let 6(u) = B’(u)+(0,0,c). It can be shown that 2(u,v) = G(u)+v6(u) 
is indeed a patch for the hyperboloid. Alternatively, let G(u) be as above and let 
6(u) = B’(u) + (0,0, —c) and verify that this, also, is a patch for the hyperboloid. 
By definition, 


ofa] = <(f9(a(t)) a0= VFal) -v. 


Express V fg as (242, ae ofa) ) and recognize that alfa) = Slo + ga fF. 


Finally, collect like terms to obtain the desired result, v[ fg] = v[f]g + fig]. 


c = f(pi,p2,p3) and v = (v1,v2,03). Thus, by definition, v[z] = 
(Sx HE SE) - (v1, 02,03). But since x(p1,p2,p3) = pi, #2 = PF = 0 and 


ie = 1. A similar procedure may be used for v[y] and v[z]. 

Let a(t) = x(a1(t),a2(t)), B(t) = x(bi(t), be(t)) with a(0) = p = B(0) and 
a’(0) = v, B'(0) = w. Then if y(t) = x(ai(t) + bi (t), a2(t) + b2(t)), Y(t) = 
ty 4 +Zy qe = ty (a}(t) +b4(t)) + rv(ag(t) + b4(t)). Finding a’(t) and 6’(t) and 
substituting yields y'(0) = v+w. Thus, (v+ w)[f] = 7/(0)[f] = Vf-7/(0). By 
using v[f] + w[f] = Vf-v+ Vf -w, show that (v + w)[f] = v[f] + w[f]. 

To compute the eigenvalues of a matrix S, we set det(AJ — S) = 0. This yields, 
in the case of the 2 x 2 symmetric matrix 


a b 

be He? 
the equation = (a+c)A+ac— b? = 0. Solve for A, and Ag and show that they 
are both real. 


For the first part, just use S(a’) = —V,/U. For the second (which is also an if 
and only if), show that both S(a’) and a’ are in PT ,,(4)M for each t. 
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1.1 


1.4 


1.7 


2.13 


2.15 


2.18 


2.19 


3.3 


CHAPTER 3: CURVATURE(S) 


Since K = k kg, ki and ko must be of opposite sign. Because k)(u) is defined 
to be the maximum curvature, k1(u) > ke(u), so ki > 0 and kg < 0 is the case 
here. 

(1) Euler’s formula states that k(u) = cos? 6k, + sin? @kg where u = cos @u, + 
sin 0ug (i.e. u is a function of 6). Thus, 


1 27 1 27 9 9 

aap k(0)d0 = x= | (cos* 0k, + sin* 0k2)dé. 

Evaluate the integral, remembering that k; and k2 are constants. 

(2) Express v1 and v2 in terms of uv, and uz. That is, v1 = cos gu, + sin dug 
and v2 = cos(¢ + %)u, +sin(¢+ 5)ug. Use Euler’s formula to obtain k(v1) and 
k(v2). 

M minimal > H(p) = 0 for every p € M. What does this imply about k, and 
kg and, in turn, about K? 

(a) Compute xu, Ly, and ry X Ly to obtain U = xd+v5 x8 Show that 


(U-tuy)® = ee a To do so, you will need to recall that a-(b xc) = 
—b-(axc). Use the Lagrange Identity to show that |ru x ay|" = EG-F?=W?. 


Finally, show that K = View) and combine with the above. 

(b) From Exercise 3.11, a ruling patch for the saddle surface is z(u,v) = (u, 0, 0) 

+ v(0,1,u). Then @(u) = (u,0,0) and 6(u) = (0,1,u). Use the results of (a) to 
=1 


obtain K = (ae? +y2 41)?" 

(c) Note that (u) = (p1, p2, p3) so that B’(u) = (0,0, 0). 

(d) Note that 6(u) = (q1, 92,93) so that 6’(u) = (0,0, 0). 

For one direction, note that U cannot depend on v only when the term v(é’ x 6) = 
0. Thus, 6’ x 6 = 0 and the formula for K of a ruled surface shows K = 0. For the 
other direction, note that Uy = —S(xv) is a tangent vector. Show that Uy-xy = 0 
(automatically!) and Uy - xu = 0 by the hypothesis AK = 0 (and the formula for 
K of a ruled surface). 

If B is a line of curvature, then 6’ -U x U’ = 6’ -U x c@’ = 0 (why?). For the 
other way, show that developable implies that U’ is perpendicular to 6’ x U which 
is also perpendicular to 8’. Then note that all these vectors are in the tangent 
plane. 

(a) and (b) are self-explanatory. In (c), use the results of part (b) in the expres- 
sions K = inn and H = OUeeP and simplify. For part (d), recognize 
(using results of (c)), that D is the numerator of K evaluated at the critical 
point (ug, v9). Since the denominator of K is always positive, D = 0 => K = 0, 
D<0=2K <0,andD>0=K > 0. What must be true of kj and kg when 
K = 0? When K < 0? What do these results imply about the surface? Two 
sub-cases correspond to K > 0. If fuu(uo, vo) is positive, k; and kg must both 
be positive. If fuu(uo, vo) is negative, ky and kg must both be negative. What 
must be true of the surface in each of these sub-cases? 

F = m= 0 for a surface of revolution. Thus, zy, and ry are orthogonal and we 
can express S(xu) in terms of the basis vectors xy and ry. Thus, let S(ru) = 
axy + bry. Compute S(ru)- ru and recognize that this is equal to 1. Compute 


3.6 


3.9 


4.2 


4.3 
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S(xu)-Zy and recognize that this is equal to m. Similarly, let S(av) = cry +dzy 
and take dot products with ry and zy. 
(a) Derive 
_ 1-u? 
~~ (1+ uze-v*)2 


by using the expression for K for a surface of revolution. Algebraically deter- 
mine when K > 0, K = 0, and K < 0. Part (b) is similar; parametrize the 
ellipse as a(u) = (R + acosu, bsinu,0) to produce the following patch for the 
elliptical torus: z(u,v) = ((R + acosu) sinv, bsinu, (R + acosu)cosv). Using 
the expression for K for a surface of revolution yields 


- ab” cos u 
(R + acos u)(b? cos? u + a? sin? u)2 


By separating variables, we obtain the expression 


ra 


Make the substitution h = 1/coshw to obtain u = f tanh? wdw. Integrate, 
then use the fact that cosh’ (¢) = In (3 +4/ pr - i) and recall that tanh z = 


_ =. Simplify to obtain w= In|} +E! - V1-h2+C. 


cf pen 
To verify the expression for Uy, recall that a pe tacacese velocity vector applied 
to a function of u and v takes the u-partial derivative of that function. Thus, 


V2, U = (xu[ui], cu[u2], ru[ug]) = Ua. 


Then, since xy and xy form a basis for Tp(M) and since Vz,, is in Tp(M), we have 
Va,U = Aru + Bry. Take the dot product of both sides to obtain Vz,U + ry = 
Aty:Xy = AE. Also recognize that 0 = ry[0] = ru[U- ru] = Ve,U -tut+U run 
and use this to show that A = —//E. Proceed in a similar manner to find B and 
to obtain an expression for Uy. 

By finding the two partial derivatives (sa), and (Sa). we obtain, as an 


equivalent expression for the righthand side, 


EuGu — 2GEvwy | EvGv | EvEv _ 2GGuu 4 GuGu 
4E?G 4G2E  4G2E  4E2G = 4G?E 4G?E 


Combine these terms over the common denominator 4E*G?. Next, compute 


2 ( Be )= Ew _ EvEvG — EEyGy 
VEG/ VEG 2(EG)2?  2(EG)? 
and 


>. ( Gu )= Guu GubuG GukEGy 


VEG) VEG 2%EG)} 2(EG)3 


Substitute into the given expression and write the result over the common de- 
nominator 4E7G? to obtain the same expression as above. 
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4.4 


5.1 


5.2 


5.4 


2.1 


A patch for a sphere of radius R is z(u,v) = (Rcosucosv, Rsinucosv, 
Rsinv). Compute ru, ry, FE, and G and substitute into the given expression 
to obtain K = 1/R?. 

(=>) If two vectors v and w are equal and are written in terms of the same 
basis vectors ru, Ly, and U, then their corresponding coefficients must be equal 
(1.€.,01 = wW1,¥2 = we, and v3 = w3). Compute v- ry and w- zy and use this fact 
to show that the two dot products are equal. Similarly, take dot products with 
Ly and U to obtain the other two equations. 

(<=) Computing v- ru, w: Zu, v- Lv, and w- Ly and using the hypotheses v- ry = 
W: Ly, V+ Ly = W: Ly, we obtain the following two equations: 


(v1 — w1)E 4+ (v2 —we)F =0 
(v1 — wi)F + (ve — we)G = 0. 


Solve simulataneously and use the fact that, in R?, EG — F? + 0 to obtain 
vy = wy, and v2 = w2. Use the hypothesis v- U = w-U to show that v3 = wg. 
Since Sp is a linear transformation from Tp(M) to itself, we can write S(xu) = 
Arzu + Bry. But since p is an umbilic point, we have S(ru) = kry => B= 0. 
Now use 


l= S(ru)-tu = AE+ BF, m=S(ru)-ty = AF+ BG 


and solve for B to get B= ee =0, so —Fl+ Em =0orl/E=m/F. Do 
the same for S(zy). 


A patch for a surface of revolution is given by z(u,v) = (u,h(u) cosy, 
h(u)sinv). Then K = GEMS = 0 => —-h” = 0. Thus, A(u) = Cyu t+ 


Cz (a line). Note that, if C] = 0, a cylinder is generated by revolving h(u) about 
the x-axis; if C; #0, a cone is generated. 


CHAPTER 4: CONSTANT MEAN CURVATURE SURFACES 
Use a Monge patch z(u,v) = (u,v, f(u,v)) to obtain fu = g’(u), fuu = 9" (u), 
fu =h'(v), fuv =0, and foy = h'(v). Then 
2 2 
H=0(1+h"(y))g9"(x) + (1 +9" (2))h"(y) = 0. 
Separate variables to obtain 


—g" (x) _ hy) 


L+g(x)  1+h(y) 
Since x and y are independent, each side is constant relative to the other side. 
Thus, let 
_ 9" (@) 
1+9'*(z) 


Also let w = g'(x) so that g(x) = 4% and integrate to obtain w = ga = —tanaz. 
Integrating again gives g(z) = 4 In(cos az). Apply the same reasoning to the 


2.4 
3.2 


4.1 


1.7 


2.6 
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other side of the original differential equation to obtain h(y) = —1 In(cos ay). 


Combining terms yields 
1 cos ax 
f (x ’ y) =-In . 
a cos ay 


oe = (p' x 6)'- 6’ + (@' x 6) -6”. Both terms in this expression are zero. 
Compute 


Ou” G+ f+ 
and 
av (1+ f2+ f2)2 


Then apply Green’s Theorem: 


llete av = | Pdv— Qa. 


Compute partial derivatives to obtain 
Ba t Oy 7 Ve (Ux tu) + Wo (UX tu) +V- [Ur xX ty — Uy X Dy]. 


Since U is a function on M, we have Uy = Va2,U = —S(av) and Uu = V2,U = 
—S(xru). Substituting in the above equation and yields 


— + =W-(U xX tu) +Vu-(U X tv) + V: (2H ty X av). 
Now apply Green’s Theorem. 


CHAPTER 5: GEODESICS, METRICS AND ISOMETRIES 


A parametrization of a right circular cylinder is given by z(u,v) = (Rcosu, 
Rsinu,bv). Then a curve on the surface is given by a(t) = (Rcosu(t), 
Rsin u(t), bv(t)). Find a” iy Ciforentiainig twice and noting that the chain rule 
gives 4 cosu(t) = —sinu9?. Now, a” = alan + (a” -U)U, where U in this case 
is (cos u, sin u, 0). dakine the dot proeuet of a” with U yields a” -U = —R(4)?. 
From this and ae a” = atsn + (a&” -U)U, we know that otan = 0 results in 
cy = 0 and 2 ay = 0, yielding u(t) = kit +c, and v(t) = kot + cg. Thus, 
a(t) = (Rcos(kit + c1), Rsin(kjt + ci), b(ket + c2)). Finally, consider the follow- 
ing cases: (1) cy = co = 0, ky = kg = 1; (2) ky = 1, ko = 0, cy = 0, co #0; (3) 
ky = 1, ko = 0, cr =90, co =f. 


VG sind = VGcos(1/2 — ¢) = Xv: a’ = Gu’ 


since a! = xyu’ + xvv’. Now use the relation v’ = c/G derived from the second 
geodesic equation. 
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2.7 In polar coordinates, a patch for the plane is given by z(u,v) = (ucosv, usinv). 


2.8 


4.3 


1.3 


1.4 


1.5 


1.6 


Compute EF = 1, F =0, and G=vu? , verifying that x is u-Clairut. Then we have 


MY = JOE > 4“ edu 
v(u) — v(uo) = SS ' 
3a Oy Ge oF G~—c? up UV U2 — c2 
Integrate using the substitution u = cseczr > du = csecxtanxdz to obtain 
v(u) — v(ug) = £cos~! £,oru ae — ug) =, the polar equation of a line. 
Compute FE = 2, F = 0, and G = u?, verifying that the patch for the cone is 
u-Clairut. Then 


See = _ cVE u e/2du 
— v(ug) = Sad Gratninlie 
uo VGVG—e2 G —c? up UV U2 — C2 
Integrate using the substitution u = csecx to obtain v(u) — v(uo) = 


V2sec™! 4%, 
Compute FE = 1/( —u’/4)?, F =0, and G = u? /(1 —u?/4)”. Then, K = 


“st (8 (i) *& (5) ~-ss (8 ($e) 


Find the required derivatives and work through the algebra to obtain K = —1. 


CHAPTER 6: HOLONOMY AND THE GAUSS-BONNET THEOREM 


A patch for the torus is given by z(u,v) = ((R+rcosu) cosv, (R+rcosu) sin v, 
rsinu). Find ru, zy, and compute |ry x xy| = r(R+rcosu). Then 


2m lan 
SA= i; i; r(R+rcosu)dudu = 4n7rR. 
0 0 


(1) A patch for a surface of revolution is given by z(u,v) = (u,h(u) cosy, 
h(u)sinv). Find zy, zy, and compute |ry x ry| = A(u)(1 + hi?(u))?. Rec- 
ognize that for a surface of revolution, h(u) is usually written as f(x). Use the 
expression for surface area to complete the exercise. (2) Define a Monge patch 


(u,v) = (u,v, f(u,v)). Then |ru x ey| = J/1+ f2 + fe. 
A patch for the bugle surface (with c = 1) is given by 


z(u,v) = (u— tanhw, sech wcosv, sech usin v). 


Compute |2u x zy| = (sech* u tanh? u+sech? u—2sech* u+sech® u)? and simplify 
using the identity tanh? u = 1—sech? u, obtaining lau X2y]| = sech utanhu. Then 


[o-e) 27 
SA= [ i sech utanhudvudu = 2r. 
0 0 


(1) Total Gaussian curvature is given by 


COs u 
d 
[x =| lap ea eo eiat 


for the torus, where |zu x zy| = r(R+rcosu). (2) A patch for the catenoid is 
given by 2(u,v) = (u, cosh ucos v, cosh usin v), yielding |ru X ry] = cosh? u. Also, 


3.1 
3.2 


3.7 


3.8 


4.1 


5.3 


5.4 


6.3 


6.6 
7.8 


1.2 


1.8 
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K=-1 j cosh? u. Integrate f[ am & to show that the total Gaussian curvature is 
—Arn. 

a’ [V -V] =2VqyV-V =0 since V is parallel. 
a[V-W]=VaV-W+V-Vq/W. V and W parallel imply a/[V - W] = 0 and 
V parallel, a’[V -W] = 0, a'[W -W] = 0 imply VW is perpendicular to both 
V and W ina plane. Thus, V,/W = 0 and W is parallel. 

For the R-sphere, K = 1 i. R? and |\Zu X Ly| = R? cosv. Integrate [ mu i to show 
that the total Gaussian curvature above ug is 2m — 27 sin vg. By Exercise 10.9, 
the holonomy around the vo-latitude curve is —27sinvg. Thus, the holonomy 
around a curve is equal to the total Gaussian curvature over the portion of the 
surface bounded by the curve (up to additions of multiples of 27). 

At the Equator, v9 = 0. What is the holonomy along the Equator and what 
does this imply about the apparent angle of rotation of a vector moving along 
the Equator? What does this signify about the Equator? 

But, of course, gravity really doesn’t point that way on a planetary torus, does 
it? 

The vector must come back to itself, so the total number of revolutions it makes 
is a multiple of 27. 

Note that the Gaussian curvature for both H and P is a constant K = —1. Thus 


we have 
i K=-f = —area of A. 
A A 


But, since the sum of the interior angles of the triangle differs from m by (+ or 
—) the total Gaussian curvature, we have 


dt; —nm =-—area of A. 


What does this imply about the sum of the angles, noting that area is a strictly 
positive quantity? 

If K < 0 and K < 0 at even a single point, then the total Gauss curvature is 
negative. But the Euler characteristic of the torus is zero. 

A disk has Euler characteristic 1. 

Their curvatures are not bounded away from zero. 


CHAPTER 7: MINIMAL SURFACES AND COMPLEX VARIABLES 


2 


For Cauchy-Riemann, z? = x? — y” + i2zy, so 


Thus, f(z) = z* is holomorphic and f'(z) = 2x + i2y = 2z as it should. 


OF seed ie O09 , 200 
33-2 'du a ima 
Pa OW Ob 
~ Ou Be Ou Oe 


by Cauchy-Riemann. 
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3.5 The calculations are exactly as in Example 3.1 except that an extra factor of 
i occurs in each term. This affects the real parts to produce z! = sinhusin v, 


zg? = —sinhucosv and 2° = v; thus, a helicoid. 
3.21 M is minimal with isothermal coordinates, so 1 = —n and, consequently, 
_ In — m? _ se a _ Pm 
~ EG-—F2” F2-0Q ~— E2 


3.22 A conformal Gauss map implies Uy, - Uy = 0. Plugging in the usual expressions 
for Uy and Uy, we get 0 = mH. Now consider the two cases, m = 0 and H = 0. 
4.7 Use Exercise 4.2 with B = (0,1,0). For instance, x1 = Rela’(z) + if Odu] 
= Re@(z), 2? = Rela®(z) + ifOdw] = Rey(z), 22 = Rela*(z) + 
if1- la’(w)|dw] = RelO0 + if /6'(w)2 + y'(w)? dw 
Im f Bw)? +7 (wy? dw. 


CHAPTER 8: THE CALCULUS OF VARIATIONS AND GEOMETRY 


1.4 


Bt Oc | Of 4 Oz dt OF 
=o+04e( Ff - 5 >L) 


Ox atdz 


if and only if $f — 2 9f — 
1.8 x(t) =t-—sint. 


1.10 x(t) = sint. 
2.1 The Euler-Lagrange equation for the time integral 


Vity’” 


is 


Then 
1 
- = C 
kyyity' : 
is separable with 
Y ___ qy = dy, 


Ja —y 
The solutions are then (x — a)* + y? = c”, circles centered on the x-axis — the 
geodesics of the Poincaré plane! 
3.1 x(t) =t—sint+b. 
3.3 z(t) =sint+c. 
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3.7 
tees 1.9 ; 1 9 2 . 
E=xsint+ 5¢ —asint — 5p +2° —(£—p)p 


(z —p)? 


A 
2 
0 


iV 


3.9 


* _ (& — p)2p 


= £7 — p* — 2pt + 2p” 


E=2#?-27*-p'+2 
‘ 2 
= («—p) 
>0 
4.2 The Euler-Lagrange equation is 
#-d-F(e+2)=0 
with simplification ¢ = —A and solution 
r 
x(t) = ~5r +at+b. 
The initial conditions give a = \/2 and b = 0. Applying the constraint, we obtain 


a x 
12> 2 ; t oa ac 


so that \ = 7. Finally, z(t) = —$t? + at. 
4.8 The equations of motion for the particle in the paraboloid are 


(1+4u*)i+ duu? +2u-us?=0 v=-. 
Uu 


4.9 T = m/2(Eu” + Gi), so (forgetting m) 
E = 1/2(Eu? + Gi* — Ep? — Gp3) — (u — pi) Epi — (0 — p2)Gpe 
} ; ‘ 
= 5 (E(u — pi)” + G(— p2)”) 
>0 
CHAPTER 9: A GLIMPSE AT HIGHER DIMENSIONS 
3.3 


[fV, gW] = fV[gW] — gW[fV] 
= fV[g|W + foVW — gW[flV — gf WV 
= fglV,W] + fV[gl\W — gWI[fIV. 
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3.3 Assume Xy -Xy = 0 and take an orthonormal basis xy = xu/ VE and xy = 


xv /VG. Then 
1 x 
vase)” = 7a ((75) ) 
Wake) = 7 \\ a), 
aie aN 
-, XuuVE oes Xu(VE)u 
~ E3/2 
af 
E 
Similarly, (Vx, Ky) =n/G. Then, the sum is 
Gl+En 
EG. et 
since F' = 0. 


5.10 Note that (R(X, X)X, X) = 0, so that, if we take X/|X| as Ep, then €1,... , Ex 
is a frame for M*. By definition then, 
k-1 
Ric(X, X) = Ric(Ex, Ex) = S> (REx: Ej )Exs Ey) 


j-1 
since (R(Ex, Ex)Ex, Ex) = 0. Then use the definition of sectional curvature and 


the definition of E&, = X/|X]. . 
5.13 Let X = 5), XE; and df = a df;0’ where & is a dual vector space basis 


element to €; (see §6) defined by 0 (E;) = 6 (where 6 is the Kronecker delta). 
Then 


df (X) = df()> X7E;) = S~ X%df(E;) = S> S> X7dfi6 (5) = S> XM df. 
i) J j 7% j 
Also. we have by the definition of divergence, 


div(f(-,-))(X) = De (F (5+) (Ej, X) 
J 
= > Esl F (Es, X)] — f(VE,Ej,X) — f(E;, Ve; X) 
J 
= Do EsLAM ES X) + FE) (Ej X) 
J 


_ f(Ve,€5,X) — F(E;, Ve, X) 
Jj 


_ f(Ve, Ej, X) a f(E5, Ve; X) 
= 7 SIME X) 
Jj 


= 2 df (E;)X! 


Jj 

= Saf; x? 
j 

= df(X) 
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k k 
K= S Ric(Ei, €) = > MERGED = kd. 
i=1 } 


i=1 


Thus, dk = kd) and we also have (by Exercise 5.13) 
2dr = 2div f(-,-) 


= 2div(Ric) 
= dk by Theorem 5.4 
= kd. 


Thus, (k — 2)d\ = 0 and, since k > 3, we must have dX = 0. Hence, X is a 
constant. 
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